2025-05-07T19:42:39.1586239Z Current runner version: '2.323.0' 2025-05-07T19:42:39.1593247Z Runner name: 'i-08ad04b373d870bec' 2025-05-07T19:42:39.1594198Z Machine name: 'ip-10-0-79-55' 2025-05-07T19:42:39.1596815Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:39.1598890Z Contents: read 2025-05-07T19:42:39.1599380Z Metadata: read 2025-05-07T19:42:39.1600078Z Packages: read 2025-05-07T19:42:39.1600649Z ##[endgroup] 2025-05-07T19:42:39.1603074Z Secret source: None 2025-05-07T19:42:39.1604050Z Prepare workflow directory 2025-05-07T19:42:39.2225079Z Prepare all required actions 2025-05-07T19:42:39.2265175Z Getting action download info 2025-05-07T19:42:39.4010400Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:39.6000962Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:39.9772468Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.10, 12.6.3, clang) 2025-05-07T19:42:40.0608635Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:40.0726354Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:40.0736525Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:40.0737530Z ##[endgroup] 2025-05-07T19:42:41.1818093Z Runner Type: linux.24xlarge 2025-05-07T19:42:41.1820030Z Instance Type: c5.24xlarge 2025-05-07T19:42:41.1820971Z AMI Name: unknown 2025-05-07T19:42:41.1857001Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:46.2225400Z ##[group]Checking docker version 2025-05-07T19:42:46.2238746Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:46.2462301Z '1.44' 2025-05-07T19:42:46.2483723Z Docker daemon API version: '1.44' 2025-05-07T19:42:46.2484240Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:46.2692861Z '1.44' 2025-05-07T19:42:46.2708239Z Docker client API version: '1.44' 2025-05-07T19:42:46.2715078Z ##[endgroup] 2025-05-07T19:42:46.2719317Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:46.2725779Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=1e1959" 2025-05-07T19:42:46.2915237Z ##[command]/usr/bin/docker network prune --force --filter "label=1e1959" 2025-05-07T19:42:46.3067302Z ##[endgroup] 2025-05-07T19:42:46.3067635Z ##[group]Create local container network 2025-05-07T19:42:46.3077034Z ##[command]/usr/bin/docker network create --label 1e1959 github_network_90ef945cace04aa28be488fe06f897dc 2025-05-07T19:42:46.5249063Z 271d6bbb5dedc25ba25b5d127988e7d711fd36bdcac64119ad770aaf3dc48c6a 2025-05-07T19:42:46.5267797Z ##[endgroup] 2025-05-07T19:42:46.5289925Z ##[group]Starting job container 2025-05-07T19:42:46.5308799Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:46.6814287Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:46.6873981Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:46.6876081Z Status: Image is up to date for amazonlinux:2023 2025-05-07T19:42:46.6889333Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:46.6973708Z ##[command]/usr/bin/docker create --name a1849b04f9ef420595d98b94e6cdfef5_amazonlinux2023_b685f4 --label 1e1959 --workdir /__w/FBGEMM/FBGEMM --network github_network_90ef945cace04aa28be488fe06f897dc --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:46.8890043Z 116e1204f840def26d5a12bed91c8919f60dd50f044201ed2ddf00f7f7c08ce4 2025-05-07T19:42:46.8917986Z ##[command]/usr/bin/docker start 116e1204f840def26d5a12bed91c8919f60dd50f044201ed2ddf00f7f7c08ce4 2025-05-07T19:42:47.3736766Z 116e1204f840def26d5a12bed91c8919f60dd50f044201ed2ddf00f7f7c08ce4 2025-05-07T19:42:47.3762107Z ##[command]/usr/bin/docker ps --all --filter id=116e1204f840def26d5a12bed91c8919f60dd50f044201ed2ddf00f7f7c08ce4 --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:47.3918127Z 116e1204f840def26d5a12bed91c8919f60dd50f044201ed2ddf00f7f7c08ce4 Up Less than a second 2025-05-07T19:42:47.3940990Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" 116e1204f840def26d5a12bed91c8919f60dd50f044201ed2ddf00f7f7c08ce4 2025-05-07T19:42:47.4094291Z HOME=/github/home 2025-05-07T19:42:47.4094788Z GITHUB_ACTIONS=true 2025-05-07T19:42:47.4095231Z CI=true 2025-05-07T19:42:47.4095725Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:47.4126668Z ##[endgroup] 2025-05-07T19:42:47.4136917Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:47.4138877Z ##[endgroup] 2025-05-07T19:42:47.4225736Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:47.4226625Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:47.4227615Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:47.4228071Z env: 2025-05-07T19:42:47.4228393Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:47.4228782Z BUILD_ENV: build_binary 2025-05-07T19:42:47.4229142Z BUILD_TARGET: default 2025-05-07T19:42:47.4229450Z BUILD_VARIANT: cuda 2025-05-07T19:42:47.4229933Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:47.4230261Z ##[endgroup] 2025-05-07T19:42:48.0534413Z Amazon Linux 2023 repository 105 MB/s | 37 MB 00:00 2025-05-07T19:42:54.7180831Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:42:55.2799016Z Dependencies resolved. 2025-05-07T19:42:55.2975826Z Nothing to do. 2025-05-07T19:42:55.2976524Z Complete! 2025-05-07T19:42:55.5303196Z Last metadata expiration check: 0:00:08 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:42:55.5943124Z Dependencies resolved. 2025-05-07T19:42:55.6171803Z ======================================================================================== 2025-05-07T19:42:55.6173406Z Package Arch Version Repository Size 2025-05-07T19:42:55.6175063Z ======================================================================================== 2025-05-07T19:42:55.6175942Z Installing: 2025-05-07T19:42:55.6176458Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:55.6177070Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:55.6177626Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:55.6178242Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:55.6178844Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:55.6179501Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:55.6180272Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:55.6180826Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.6181425Z Installing dependencies: 2025-05-07T19:42:55.6181926Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:55.6182500Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:55.6183302Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6183958Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:55.6184907Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:55.6185618Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:55.6186287Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:55.6186848Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:55.6187421Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:55.6188039Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:55.6188612Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:55.6189190Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:55.6189922Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:55.6190451Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:55.6191078Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:55.6191677Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:55.6192209Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:55.6192819Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:55.6193458Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.6194097Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:55.6194743Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:55.6195314Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:55.6195892Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:55.6196408Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:55.6311099Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:55.6311914Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:55.6312486Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:55.6313072Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:55.6313749Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:55.6314348Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6315042Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:55.6315713Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:55.6316342Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:55.6316946Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:55.6317598Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:55.6318182Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:55.6318788Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.6319394Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:55.6320233Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:55.6320806Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.6321361Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6322136Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.6322701Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:55.6323281Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:55.6323898Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:55.6324497Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.6325080Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:55.6325808Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:55.6326400Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:55.6327043Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:55.6327641Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:55.6328227Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.6328812Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:55.6329359Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:55.6329921Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:55.6330502Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.6331108Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:55.6331744Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:55.6332305Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:55.6332915Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:55.6333512Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:55.6334120Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:55.6334839Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6335428Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:55.6336055Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:55.6336614Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:55.6337164Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:55.6337707Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.6338303Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:55.6338905Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:55.6339555Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.6340364Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:55.6341006Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:55.6341752Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:55.6342333Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:55.6342875Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:55.6343453Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:55.6344005Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:55.6344572Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:55.6345114Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.6345659Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:55.6346206Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:55.6346783Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:55.6347334Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:55.6347917Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:55.6348506Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:55.6349086Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:55.6349642Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:55.6350193Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:55.6350729Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:55.6351323Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:55.6351871Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:55.6352497Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:55.6352928Z Installing weak dependencies: 2025-05-07T19:42:55.6353370Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:55.6353933Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.6354498Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:55.6355072Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:55.6355610Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:55.6356137Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:55.6356487Z 2025-05-07T19:42:55.6356587Z Transaction Summary 2025-05-07T19:42:55.6356865Z ======================================================================================== 2025-05-07T19:42:55.6357174Z Install 107 Packages 2025-05-07T19:42:55.6357321Z 2025-05-07T19:42:55.6357479Z Total download size: 38 M 2025-05-07T19:42:55.6357730Z Installed size: 151 M 2025-05-07T19:42:55.6357983Z Downloading Packages: 2025-05-07T19:42:55.9112512Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 3.5 MB/s | 82 kB 00:00 2025-05-07T19:42:55.9248990Z (2/107): elfutils-debuginfod-client-0.188-3.amz 7.1 MB/s | 41 kB 00:00 2025-05-07T19:42:55.9479160Z (3/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 89 MB/s | 5.3 MB 00:00 2025-05-07T19:42:55.9543987Z (4/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 12 MB/s | 786 kB 00:00 2025-05-07T19:42:55.9581464Z (5/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 17 MB/s | 539 kB 00:00 2025-05-07T19:42:55.9603538Z (6/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 4.9 MB/s | 54 kB 00:00 2025-05-07T19:42:55.9766516Z (7/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 68 MB/s | 1.1 MB 00:00 2025-05-07T19:42:56.0075998Z (8/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 57 MB/s | 2.8 MB 00:00 2025-05-07T19:42:56.0138651Z (9/107): groff-base-1.22.4-7.amzn2023.0.2.x86_6 33 MB/s | 1.0 MB 00:00 2025-05-07T19:42:56.0353279Z (10/107): git-core-2.47.1-1.amzn2023.0.2.x86_64 62 MB/s | 4.7 MB 00:00 2025-05-07T19:42:56.0408931Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 6.0 MB/s | 160 kB 00:00 2025-05-07T19:42:56.0498592Z (12/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 46 MB/s | 1.6 MB 00:00 2025-05-07T19:42:56.0519221Z (13/107): jansson-2.14-0.amzn2023.x86_64.rpm 3.3 MB/s | 46 kB 00:00 2025-05-07T19:42:56.0535447Z (14/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 5.0 MB/s | 62 kB 00:00 2025-05-07T19:42:56.0631361Z (15/107): less-608-2.amzn2023.0.2.x86_64.rpm 13 MB/s | 168 kB 00:00 2025-05-07T19:42:56.0644227Z (16/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 5.4 MB/s | 57 kB 00:00 2025-05-07T19:42:56.0703845Z (17/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 47 MB/s | 756 kB 00:00 2025-05-07T19:42:56.0719719Z (18/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 4.4 MB/s | 28 kB 00:00 2025-05-07T19:42:56.0741071Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 13 MB/s | 108 kB 00:00 2025-05-07T19:42:56.0784679Z (20/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 19 MB/s | 153 kB 00:00 2025-05-07T19:42:56.0810507Z (21/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 11 MB/s | 95 kB 00:00 2025-05-07T19:42:56.0826935Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 3.6 MB/s | 31 kB 00:00 2025-05-07T19:42:56.0886059Z (23/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 17 MB/s | 121 kB 00:00 2025-05-07T19:42:56.0904953Z (24/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 8.9 MB/s | 106 kB 00:00 2025-05-07T19:42:56.0914018Z (25/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 3.1 MB/s | 26 kB 00:00 2025-05-07T19:42:56.1007035Z (26/107): nano-8.3-1.amzn2023.x86_64.rpm 59 MB/s | 706 kB 00:00 2025-05-07T19:42:56.1028018Z (27/107): nano-default-editor-8.3-1.amzn2023.no 941 kB/s | 10 kB 00:00 2025-05-07T19:42:56.1062980Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 27 MB/s | 394 kB 00:00 2025-05-07T19:42:56.1112047Z (29/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 58 MB/s | 573 kB 00:00 2025-05-07T19:42:56.1166046Z (30/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 28 MB/s | 256 kB 00:00 2025-05-07T19:42:56.1224740Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 30 MB/s | 454 kB 00:00 2025-05-07T19:42:56.1279962Z (32/107): openssh-clients-8.7p1-8.amzn2023.0.14 43 MB/s | 708 kB 00:00 2025-05-07T19:42:56.1327194Z (33/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 37 MB/s | 542 kB 00:00 2025-05-07T19:42:56.1344448Z (34/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 8.1 MB/s | 93 kB 00:00 2025-05-07T19:42:56.1369428Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 5.1 MB/s | 41 kB 00:00 2025-05-07T19:42:56.1391898Z (36/107): perl-AutoLoader-5.74-477.amzn2023.0.6 5.6 MB/s | 22 kB 00:00 2025-05-07T19:42:56.1433479Z (37/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 4.8 MB/s | 29 kB 00:00 2025-05-07T19:42:56.1465482Z (38/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 16 MB/s | 179 kB 00:00 2025-05-07T19:42:56.1475483Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 2.7 MB/s | 22 kB 00:00 2025-05-07T19:42:56.1502224Z (40/107): perl-Data-Dumper-2.174-460.amzn2023.0 8.6 MB/s | 55 kB 00:00 2025-05-07T19:42:56.1530856Z (41/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 5.6 MB/s | 26 kB 00:00 2025-05-07T19:42:56.1546972Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 5.8 MB/s | 36 kB 00:00 2025-05-07T19:42:56.1567646Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 4.2 MB/s | 26 kB 00:00 2025-05-07T19:42:56.1604739Z (44/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 3.1 MB/s | 15 kB 00:00 2025-05-07T19:42:56.1716064Z (45/107): perl-Encode-3.15-462.amzn2023.0.2.x86 94 MB/s | 1.7 MB 00:00 2025-05-07T19:42:56.1731650Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.5 MB/s | 41 kB 00:00 2025-05-07T19:42:56.1742056Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 2.9 MB/s | 31 kB 00:00 2025-05-07T19:42:56.1831998Z (48/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 1.8 MB/s | 21 kB 00:00 2025-05-07T19:42:56.1844548Z (49/107): perl-File-Basename-2.85-477.amzn2023. 1.8 MB/s | 18 kB 00:00 2025-05-07T19:42:56.1860409Z (50/107): perl-File-Find-1.37-477.amzn2023.0.6. 2.4 MB/s | 26 kB 00:00 2025-05-07T19:42:56.1909066Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 6.0 MB/s | 36 kB 00:00 2025-05-07T19:42:56.1928032Z (52/107): perl-File-stat-1.09-477.amzn2023.0.6. 2.7 MB/s | 17 kB 00:00 2025-05-07T19:42:56.1949949Z (53/107): perl-File-Temp-0.231.100-2.amzn2023.0 5.9 MB/s | 60 kB 00:00 2025-05-07T19:42:56.1968138Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 2.7 MB/s | 16 kB 00:00 2025-05-07T19:42:56.1985474Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 11 MB/s | 60 kB 00:00 2025-05-07T19:42:56.2015586Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 2.7 MB/s | 16 kB 00:00 2025-05-07T19:42:56.2041310Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 6.2 MB/s | 42 kB 00:00 2025-05-07T19:42:56.2055376Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 8.3 MB/s | 56 kB 00:00 2025-05-07T19:42:56.2082618Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 15 MB/s | 87 kB 00:00 2025-05-07T19:42:56.2102528Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 9.5 MB/s | 42 kB 00:00 2025-05-07T19:42:56.2130692Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 29 MB/s | 218 kB 00:00 2025-05-07T19:42:56.2144291Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 3.8 MB/s | 23 kB 00:00 2025-05-07T19:42:56.2165390Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 5.6 MB/s | 31 kB 00:00 2025-05-07T19:42:56.2191608Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.2 MB/s | 13 kB 00:00 2025-05-07T19:42:56.2203994Z (65/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 4.1 MB/s | 23 kB 00:00 2025-05-07T19:42:56.2251891Z (66/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 17 MB/s | 97 kB 00:00 2025-05-07T19:42:56.2274896Z (67/107): perl-PathTools-3.78-459.amzn2023.0.2. 13 MB/s | 85 kB 00:00 2025-05-07T19:42:56.2341999Z (68/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 13 MB/s | 84 kB 00:00 2025-05-07T19:42:56.2359588Z (69/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 1.9 MB/s | 20 kB 00:00 2025-05-07T19:42:56.2404406Z (70/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 16 MB/s | 392 kB 00:00 2025-05-07T19:42:56.2438572Z (71/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 24 MB/s | 215 kB 00:00 2025-05-07T19:42:56.2461457Z (72/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 4.5 MB/s | 41 kB 00:00 2025-05-07T19:42:56.2476114Z (73/107): perl-Scalar-List-Utils-1.56-459.amzn2 11 MB/s | 71 kB 00:00 2025-05-07T19:42:56.2501746Z (74/107): perl-SelectSaver-1.02-477.amzn2023.0. 2.2 MB/s | 12 kB 00:00 2025-05-07T19:42:56.2542832Z (75/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 9.5 MB/s | 55 kB 00:00 2025-05-07T19:42:56.2568192Z (76/107): perl-Storable-3.21-458.amzn2023.0.2.x 12 MB/s | 96 kB 00:00 2025-05-07T19:42:56.2578100Z (77/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 1.9 MB/s | 15 kB 00:00 2025-05-07T19:42:56.2599418Z (78/107): perl-Term-ANSIColor-5.01-459.amzn2023 9.7 MB/s | 48 kB 00:00 2025-05-07T19:42:56.2628823Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 4.9 MB/s | 22 kB 00:00 2025-05-07T19:42:56.2651468Z (80/107): perl-Text-ParseWords-3.30-458.amzn202 3.4 MB/s | 17 kB 00:00 2025-05-07T19:42:56.2676083Z (81/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 5.1 MB/s | 22 kB 00:00 2025-05-07T19:42:56.2719908Z (82/107): perl-Time-Local-1.300-5.amzn2023.0.2. 5.3 MB/s | 34 kB 00:00 2025-05-07T19:42:56.2744160Z (83/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 16 MB/s | 108 kB 00:00 2025-05-07T19:42:56.2779144Z (84/107): perl-base-2.27-477.amzn2023.0.6.noarc 3.1 MB/s | 17 kB 00:00 2025-05-07T19:42:56.2796108Z (85/107): perl-constant-1.33-459.amzn2023.0.2.n 4.7 MB/s | 23 kB 00:00 2025-05-07T19:42:56.2830964Z (86/107): perl-if-0.60.800-477.amzn2023.0.6.noa 3.2 MB/s | 14 kB 00:00 2025-05-07T19:42:56.2859532Z (87/107): perl-interpreter-5.32.1-477.amzn2023. 12 MB/s | 71 kB 00:00 2025-05-07T19:42:56.2931422Z (88/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 1.6 MB/s | 15 kB 00:00 2025-05-07T19:42:56.2957416Z (89/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 13 MB/s | 126 kB 00:00 2025-05-07T19:42:56.3124835Z (90/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 110 MB/s | 2.0 MB 00:00 2025-05-07T19:42:56.3142465Z (91/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 1.6 MB/s | 29 kB 00:00 2025-05-07T19:42:56.3180234Z (92/107): perl-overload-1.31-477.amzn2023.0.6.n 8.8 MB/s | 46 kB 00:00 2025-05-07T19:42:56.3193641Z (93/107): perl-overloading-0.02-477.amzn2023.0. 2.7 MB/s | 13 kB 00:00 2025-05-07T19:42:56.3230610Z (94/107): perl-parent-0.238-458.amzn2023.0.2.no 2.9 MB/s | 14 kB 00:00 2025-05-07T19:42:56.3256805Z (95/107): perl-podlators-4.14-458.amzn2023.0.2. 19 MB/s | 112 kB 00:00 2025-05-07T19:42:56.3275684Z (96/107): perl-subs-1.03-477.amzn2023.0.6.noarc 2.9 MB/s | 12 kB 00:00 2025-05-07T19:42:56.3305548Z (97/107): perl-vars-1.05-477.amzn2023.0.6.noarc 2.9 MB/s | 13 kB 00:00 2025-05-07T19:42:56.3350434Z (98/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 472 kB/s | 36 kB 00:00 2025-05-07T19:42:56.3419092Z (99/107): shadow-utils-4.9-12.amzn2023.0.4.x86_ 80 MB/s | 1.1 MB 00:00 2025-05-07T19:42:56.3504253Z (100/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 64 MB/s | 1.3 MB 00:00 2025-05-07T19:42:56.3530537Z (101/107): sudo-python-plugin-1.9.15-1.p5.amzn2 3.1 MB/s | 56 kB 00:00 2025-05-07T19:42:56.3571115Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 46 MB/s | 613 kB 00:00 2025-05-07T19:42:56.3635784Z (103/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 73 MB/s | 879 kB 00:00 2025-05-07T19:42:56.3710074Z (104/107): util-linux-core-2.37.4-1.amzn2023.0. 35 MB/s | 432 kB 00:00 2025-05-07T19:42:56.3829773Z (105/107): util-linux-2.37.4-1.amzn2023.0.4.x86 90 MB/s | 2.2 MB 00:00 2025-05-07T19:42:56.3892743Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 32 MB/s | 779 kB 00:00 2025-05-07T19:42:56.3902205Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 2.4 MB/s | 42 kB 00:00 2025-05-07T19:42:56.3919578Z -------------------------------------------------------------------------------- 2025-05-07T19:42:56.3920088Z Total 49 MB/s | 38 MB 00:00 2025-05-07T19:42:57.4661014Z Running transaction check 2025-05-07T19:42:57.5117682Z Transaction check succeeded. 2025-05-07T19:42:57.5118556Z Running transaction test 2025-05-07T19:42:57.8780345Z Transaction test succeeded. 2025-05-07T19:42:57.8781308Z Running transaction 2025-05-07T19:42:58.5777752Z Preparing : 1/1 2025-05-07T19:42:58.5921740Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:58.6158932Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:58.6359035Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:58.6404183Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:58.6470933Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:58.6566178Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:58.6833772Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:58.6893505Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:58.6944499Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:58.7445187Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:58.7504052Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:58.7805525Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:58.7860573Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:58.7913605Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:58.7971634Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:58.8011706Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:58.8139545Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:58.8181623Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:58.8227636Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:58.8295530Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:58.8342393Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:58.8387347Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:58.8807999Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:58.8876464Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:58.9009548Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:58.9420501Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:58.9581639Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:59.0374133Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:59.0375830Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:59.0377193Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:59.0377954Z 2025-05-07T19:42:59.0552929Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:59.0822770Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:59.1012575Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:59.1070532Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:59.2182560Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:59.3678854Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:59.3813237Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:59.4225067Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:59.4312330Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:59.4385003Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:59.4458783Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:59.4548480Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:59.4604514Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:59.4651986Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:59.4703579Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:59.4791795Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:59.4861499Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:59.4966284Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:59.5179520Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:59.5265004Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:59.5318102Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:59.5363580Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:59.5422801Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:59.5481762Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:59.5539733Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:59.5624139Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:42:59.5694275Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:42:59.5742306Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:42:59.5808843Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:42:59.5871020Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:42:59.5928583Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:42:59.5971163Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:59.6023127Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:42:59.6096477Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:42:59.6153018Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:42:59.6267236Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:42:59.6357399Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:42:59.6414595Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:42:59.6461337Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:42:59.6502891Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:59.6580947Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:42:59.6680062Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:59.6747793Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:42:59.6805442Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:42:59.6860206Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:42:59.6937359Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:42:59.7003293Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:42:59.7062223Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:42:59.7134249Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:59.7184709Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:42:59.7243122Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:42:59.7302155Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:59.7381855Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:59.7456478Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:42:59.7526120Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:42:59.7586843Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:42:59.7635423Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:42:59.7684964Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:42:59.7745476Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:42:59.7803869Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:42:59.7857968Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:42:59.7917898Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:59.7970124Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:42:59.8048694Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:42:59.8583353Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:42:59.9564689Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:42:59.9686672Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:59.9770931Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:42:59.9837731Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:42:59.9902959Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:42:59.9973956Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:43:00.0023816Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:43:00.0088331Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:43:00.0164871Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:43:00.0367112Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:43:00.0489072Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:43:00.0569976Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:43:00.0971981Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:43:00.2200699Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:43:00.2296575Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:43:00.2409527Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:43:00.2707207Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:43:00.2807455Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:43:00.3054441Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:43:00.3263262Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:43:00.3342732Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:00.3460566Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:43:01.0942865Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:01.0943700Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:43:01.0944369Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:43:01.0945071Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:43:01.0945784Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:43:01.0946410Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:43:01.0947079Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:43:01.0947707Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:43:01.0948307Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:43:01.0949280Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:43:01.0949889Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:43:01.0950545Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:43:01.0951235Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:43:01.0951820Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:43:01.0952469Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:43:01.0953067Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:43:01.0953750Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:43:01.0954389Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:43:01.0954990Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:43:01.0955673Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:43:01.0956273Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:43:01.0956930Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:43:01.0957627Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:43:01.0958247Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:43:01.0958992Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:43:01.0959601Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:43:01.0960253Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:43:01.0960869Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:43:01.0961569Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:43:01.0962214Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:43:01.0962816Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:43:01.0963495Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:43:01.0964113Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:43:01.0964770Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:43:01.0965430Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:43:01.0966044Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:43:01.0966731Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:43:01.0967534Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:43:01.0968181Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:43:01.0968861Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:43:01.0969577Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:43:01.0970291Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:43:01.0971020Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:43:01.0971645Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:43:01.0972299Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:43:01.0972890Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:43:01.0973635Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:43:01.0974242Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:43:01.0974850Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:43:01.0975386Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:43:01.0975952Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:43:01.0976506Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:43:01.0977053Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:43:01.0977593Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:43:01.0978169Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:43:01.0978748Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:43:01.0979437Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:43:01.0980001Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:43:01.0980538Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:43:01.0981088Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:43:01.0981645Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:43:01.0982201Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:43:01.0982774Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:43:01.0983322Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:43:01.0983891Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:43:01.0984436Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:43:01.0984991Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:43:01.0985556Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:43:01.0986097Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:43:01.0986662Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:43:01.0987213Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:43:01.0987778Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:43:01.0988311Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:43:01.0988862Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:43:01.0989516Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:43:01.0990048Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:43:01.0990576Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:43:01.0991099Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:43:01.0991663Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:43:01.0992375Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:43:01.0993005Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:43:01.0993562Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:43:01.0994089Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:43:01.0994603Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:43:01.0995171Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:43:01.0995654Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:43:01.0996178Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:43:01.0996671Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:43:01.0997172Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:43:01.0997659Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:43:01.0998164Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:43:01.0998666Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:43:01.0999134Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:43:01.0999652Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:43:01.1000167Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:43:01.1000681Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:43:01.1001181Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:43:01.1001669Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:43:01.1002170Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:43:01.1002653Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:43:01.1003140Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:43:01.1003632Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:43:01.1004164Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:43:01.1004651Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:43:01.1005116Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:43:01.1005625Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:43:01.1006103Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:43:01.1970394Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:01.1971523Z 2025-05-07T19:43:01.1971781Z Installed: 2025-05-07T19:43:01.1972723Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:43:01.1974289Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1975922Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:43:01.1976929Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1977481Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1977942Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1978421Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1978930Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.1979531Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:43:01.1980237Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1980803Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.1981331Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:43:01.1981969Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:43:01.1982497Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:43:01.1983013Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1983512Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1984031Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1984533Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:43:01.1985091Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1985645Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.1986162Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1986717Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1987272Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1987837Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1988387Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1988913Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:43:01.1989464Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:43:01.1990032Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:43:01.1990579Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.1991106Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:43:01.1991633Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:43:01.1992283Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:43:01.1992771Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:43:01.1993247Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1993742Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1994305Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.1994827Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.1995355Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.1995926Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.1996482Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1997141Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.1997677Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1998251Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.1998784Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:43:01.1999331Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.1999878Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:43:01.2000402Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.2000961Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2001507Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2002154Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2002700Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2003213Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2003747Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2004277Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2004823Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2005347Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2005878Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.2006407Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:43:01.2006909Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2007435Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:43:01.2007966Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.2008527Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2009063Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2009624Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:43:01.2010187Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2010700Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.2011246Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2011771Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2012331Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.2012889Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:43:01.2013450Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2013984Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2014510Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2015081Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2015636Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2016145Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2016699Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2017323Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.2017893Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.2018453Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2019022Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.2019900Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:43:01.2020496Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:43:01.2021076Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.2021629Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2022410Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.2023006Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2023777Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2024370Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2024931Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2025514Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2026048Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2026642Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2027270Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2027850Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.2028433Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.2029008Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2029588Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2030162Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2030685Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:43:01.2031260Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:43:01.2031833Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:43:01.2032379Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2032897Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2033526Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2034212Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2034682Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2035003Z 2025-05-07T19:43:01.2035094Z Complete! 2025-05-07T19:43:01.2704109Z ##[group]Run actions/checkout@v4 2025-05-07T19:43:01.2704517Z with: 2025-05-07T19:43:01.2704752Z submodules: true 2025-05-07T19:43:01.2705054Z repository: pytorch/FBGEMM 2025-05-07T19:43:01.2705579Z token: *** 2025-05-07T19:43:01.2705814Z ssh-strict: true 2025-05-07T19:43:01.2706087Z ssh-user: git 2025-05-07T19:43:01.2706343Z persist-credentials: true 2025-05-07T19:43:01.2706655Z clean: true 2025-05-07T19:43:01.2706915Z sparse-checkout-cone-mode: true 2025-05-07T19:43:01.2707251Z fetch-depth: 1 2025-05-07T19:43:01.2707493Z fetch-tags: false 2025-05-07T19:43:01.2707766Z show-progress: true 2025-05-07T19:43:01.2708003Z lfs: false 2025-05-07T19:43:01.2708258Z set-safe-directory: true 2025-05-07T19:43:01.2708750Z env: 2025-05-07T19:43:01.2709012Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:01.2709361Z BUILD_ENV: build_binary 2025-05-07T19:43:01.2709621Z BUILD_TARGET: default 2025-05-07T19:43:01.2709959Z BUILD_VARIANT: cuda 2025-05-07T19:43:01.2710300Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:01.2710610Z ##[endgroup] 2025-05-07T19:43:01.2755088Z ##[command]/usr/bin/docker exec 116e1204f840def26d5a12bed91c8919f60dd50f044201ed2ddf00f7f7c08ce4 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:43:01.6508826Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:43:01.6510219Z ##[group]Getting Git version info 2025-05-07T19:43:01.6510557Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:43:01.6511171Z [command]/usr/bin/git version 2025-05-07T19:43:01.6511450Z git version 2.47.1 2025-05-07T19:43:01.6512379Z ##[endgroup] 2025-05-07T19:43:01.6521903Z Temporarily overriding HOME='/__w/_temp/d4dc27fe-768e-43cb-9e33-1810914c8fb8' before making global git config changes 2025-05-07T19:43:01.6522899Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:43:01.6532598Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:43:01.6564653Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:43:01.6583604Z https://github.com/pytorch/FBGEMM 2025-05-07T19:43:01.6597668Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:43:01.6600774Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:43:01.6620127Z HEAD 2025-05-07T19:43:01.6653201Z ##[endgroup] 2025-05-07T19:43:01.6653525Z [command]/usr/bin/git submodule status 2025-05-07T19:43:01.7022609Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:43:01.7095157Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (4a61bdd) 2025-05-07T19:43:01.7161523Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:43:01.7234751Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (3ed8d2e) 2025-05-07T19:43:01.7308525Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (f8d7d77) 2025-05-07T19:43:01.7382730Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (4200844) 2025-05-07T19:43:01.7456004Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (9cca280) 2025-05-07T19:43:01.7460933Z ##[group]Cleaning the repository 2025-05-07T19:43:01.7463463Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:43:02.0342028Z Removing build_only/ 2025-05-07T19:43:02.0342390Z Removing collect_env.py 2025-05-07T19:43:02.0342688Z Removing fbgemm_gpu/_skbuild/ 2025-05-07T19:43:02.0343076Z Removing fbgemm_gpu/codegen/genscript/__pycache__/ 2025-05-07T19:43:02.0343464Z Removing fbgemm_gpu/dist/ 2025-05-07T19:43:02.0343806Z Removing fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:43:02.0344305Z Removing fbgemm_gpu/fbgemm_gpu_nightly.egg-info/ 2025-05-07T19:43:02.0350921Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:43:02.1430896Z HEAD is now at 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:02.1435001Z ##[endgroup] 2025-05-07T19:43:02.1437607Z ##[group]Disabling automatic garbage collection 2025-05-07T19:43:02.1442761Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:43:02.1472912Z ##[endgroup] 2025-05-07T19:43:02.1473355Z ##[group]Setting up auth 2025-05-07T19:43:02.1478655Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:43:02.1500906Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:43:02.1786528Z Entering 'external/asmjit' 2025-05-07T19:43:02.1860698Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.1927946Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.1990820Z Entering 'external/cutlass' 2025-05-07T19:43:02.2068113Z Entering 'external/googletest' 2025-05-07T19:43:02.2135854Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.2198122Z Entering 'external/json' 2025-05-07T19:43:02.2273353Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:43:02.2299736Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:43:02.2620748Z Entering 'external/asmjit' 2025-05-07T19:43:02.2674804Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.2731628Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.2799112Z Entering 'external/cutlass' 2025-05-07T19:43:02.2871611Z Entering 'external/googletest' 2025-05-07T19:43:02.2925246Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.2982409Z Entering 'external/json' 2025-05-07T19:43:02.3070290Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:02.3118515Z ##[endgroup] 2025-05-07T19:43:02.3119021Z ##[group]Fetching the repository 2025-05-07T19:43:02.3123300Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:43:02.5056906Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:43:02.5058053Z + 1c9ad64...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:43:02.5071174Z ##[endgroup] 2025-05-07T19:43:02.5071621Z ##[group]Determining the checkout info 2025-05-07T19:43:02.5107479Z ##[endgroup] 2025-05-07T19:43:02.5107832Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:43:02.5600086Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:43:02.5624947Z ##[group]Checking out the ref 2025-05-07T19:43:02.5626247Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:43:02.6621584Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:43:02.6622687Z any of your branches: 2025-05-07T19:43:02.6622855Z 2025-05-07T19:43:02.6623284Z 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:02.6623759Z 2025-05-07T19:43:02.6623979Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:43:02.6624401Z to do so with: 2025-05-07T19:43:02.6624544Z 2025-05-07T19:43:02.6624684Z git branch 1c9ad64 2025-05-07T19:43:02.6624919Z 2025-05-07T19:43:02.6625326Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:02.6626607Z ##[endgroup] 2025-05-07T19:43:02.6627110Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:43:02.6630365Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:02.6667970Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:43:02.6689217Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:43:02.6713867Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:43:02.6734369Z ##[endgroup] 2025-05-07T19:43:02.6735417Z ##[group]Fetching submodules 2025-05-07T19:43:02.6736739Z [command]/usr/bin/git submodule sync 2025-05-07T19:43:02.7051864Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:43:02.7053240Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:43:02.7054548Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:43:02.7055681Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:43:02.7056962Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:43:02.7057398Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:43:02.7058107Z Synchronizing submodule url for 'external/json' 2025-05-07T19:43:02.7059083Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:43:02.7782241Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:43:03.0452787Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:43:03.1378286Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:43:03.8063113Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:43:03.8448716Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:43:03.8525463Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:43:03.9590880Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:43:03.9604983Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:43:03.9942691Z Entering 'external/asmjit' 2025-05-07T19:43:03.9969561Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.0004403Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.0027272Z Entering 'external/cutlass' 2025-05-07T19:43:04.0060285Z Entering 'external/googletest' 2025-05-07T19:43:04.0088354Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.0121698Z Entering 'external/json' 2025-05-07T19:43:04.0164564Z ##[endgroup] 2025-05-07T19:43:04.0165001Z ##[group]Persisting credentials for submodules 2025-05-07T19:43:04.0168544Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:43:04.0471510Z Entering 'external/asmjit' 2025-05-07T19:43:04.0513493Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0514897Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0550392Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.0587290Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0588283Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0628255Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.0671431Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0672457Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0708899Z Entering 'external/cutlass' 2025-05-07T19:43:04.0753068Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0754082Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0801243Z Entering 'external/googletest' 2025-05-07T19:43:04.0845777Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0846738Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0883653Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.0919138Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0919642Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0959675Z Entering 'external/json' 2025-05-07T19:43:04.0997378Z url.https://github.com/.insteadof 2025-05-07T19:43:04.0997795Z url.https://github.com/.insteadof 2025-05-07T19:43:04.1055221Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:43:04.1330911Z Entering 'external/asmjit' 2025-05-07T19:43:04.1383375Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:43:04.1391608Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.1446278Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:43:04.1446888Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.1502373Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:43:04.1507763Z Entering 'external/cutlass' 2025-05-07T19:43:04.1555491Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:43:04.1557982Z Entering 'external/googletest' 2025-05-07T19:43:04.1607514Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:43:04.1608148Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.1655528Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:43:04.1656287Z Entering 'external/json' 2025-05-07T19:43:04.1702213Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:43:04.1768572Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:43:04.2043568Z Entering 'external/asmjit' 2025-05-07T19:43:04.2065385Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.2091515Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.2111606Z Entering 'external/cutlass' 2025-05-07T19:43:04.2134753Z Entering 'external/googletest' 2025-05-07T19:43:04.2162433Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.2196943Z Entering 'external/json' 2025-05-07T19:43:04.2237692Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:43:04.2503106Z Entering 'external/asmjit' 2025-05-07T19:43:04.2526233Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.2549481Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.2583401Z Entering 'external/cutlass' 2025-05-07T19:43:04.2614704Z Entering 'external/googletest' 2025-05-07T19:43:04.2651033Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.2686127Z Entering 'external/json' 2025-05-07T19:43:04.2725587Z ##[endgroup] 2025-05-07T19:43:04.2754812Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:43:04.2777307Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:04.2998800Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:43:04.2999220Z . $PRELUDE; print_system_info 2025-05-07T19:43:04.2999774Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:04.3000139Z env: 2025-05-07T19:43:04.3000403Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:04.3000715Z BUILD_ENV: build_binary 2025-05-07T19:43:04.3000999Z BUILD_TARGET: default 2025-05-07T19:43:04.3001246Z BUILD_VARIANT: cuda 2025-05-07T19:43:04.3001521Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:04.3001781Z ##[endgroup] 2025-05-07T19:43:04.7400760Z ################################################################################ 2025-05-07T19:43:04.7401157Z # Print System Info 2025-05-07T19:43:04.7401454Z # 2025-05-07T19:43:04.7418529Z # [2025-05-07T19:43:04.741Z] + print_system_info 2025-05-07T19:43:04.7418999Z ################################################################################ 2025-05-07T19:43:04.7419379Z 2025-05-07T19:43:04.7419601Z ################################################################################ 2025-05-07T19:43:04.7419960Z [INFO] Printing environment variables ... 2025-05-07T19:43:04.7420325Z + printenv 2025-05-07T19:43:04.7420449Z 2025-05-07T19:43:04.7431995Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:43:04.7432935Z BUILD_VARIANT=cuda 2025-05-07T19:43:04.7433592Z HOSTNAME=116e1204f840 2025-05-07T19:43:04.7434792Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_78b34970-a86f-4ce8-b2de-0b870bda1a0c 2025-05-07T19:43:04.7436197Z GITHUB_ACTION=__run_2 2025-05-07T19:43:04.7436937Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:43:04.7437670Z RUNNER_NAME=i-08ad04b373d870bec 2025-05-07T19:43:04.7438471Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:43:04.7439369Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:43:04.7440161Z MACHINE_NAME_LC=x86_64 2025-05-07T19:43:04.7440850Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:43:04.7441681Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:43:04.7442535Z GITHUB_REF_TYPE=branch 2025-05-07T19:43:04.7443449Z *** 2025-05-07T19:43:04.7443678Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:43:04.7443990Z GITHUB_ACTIONS=true 2025-05-07T19:43:04.7447356Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:04.7447979Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:43:04.7448569Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:43:04.7448866Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:43:04.7449275Z RUNNER_OS=Linux 2025-05-07T19:43:04.7449511Z GITHUB_REF_PROTECTED=false 2025-05-07T19:43:04.7449802Z HOME=/github/home 2025-05-07T19:43:04.7450062Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:43:04.7450387Z RUNNER_ARCH=X64 2025-05-07T19:43:04.7450618Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:43:04.7450893Z BUILD_TARGET=default 2025-05-07T19:43:04.7451342Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_78b34970-a86f-4ce8-b2de-0b870bda1a0c 2025-05-07T19:43:04.7451985Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_78b34970-a86f-4ce8-b2de-0b870bda1a0c 2025-05-07T19:43:04.7452504Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:43:04.7452834Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:43:04.7453138Z GITHUB_RUN_ID=14891846252 2025-05-07T19:43:04.7453607Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_78b34970-a86f-4ce8-b2de-0b870bda1a0c 2025-05-07T19:43:04.7454142Z BUILD_ENV=build_binary 2025-05-07T19:43:04.7454379Z GITHUB_ACTOR=q10 2025-05-07T19:43:04.7454627Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:43:04.7454882Z KERN_NAME_LC=linux 2025-05-07T19:43:04.7455115Z BUILD_CUDA_VERSION=12.6.3 2025-05-07T19:43:04.7455442Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:43:04.7455964Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:43:04.7456382Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:43:04.7456705Z SHLVL=1 2025-05-07T19:43:04.7456932Z GITHUB_ACTOR_ID=255046 2025-05-07T19:43:04.7457182Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:43:04.7457724Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:43:04.7458123Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:43:04.7458445Z KERN_NAME=Linux 2025-05-07T19:43:04.7458703Z GITHUB_JOB=build_artifact 2025-05-07T19:43:04.7458983Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:43:04.7459393Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:43:04.7459660Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:43:04.7459961Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:43:04.7460473Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:43:04.7460908Z GITHUB_BASE_REF=main 2025-05-07T19:43:04.7461145Z CI=true 2025-05-07T19:43:04.7461390Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:43:04.7461685Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:43:04.7462005Z GITHUB_ACTION_REF= 2025-05-07T19:43:04.7462266Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:43:04.7462793Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_78b34970-a86f-4ce8-b2de-0b870bda1a0c 2025-05-07T19:43:04.7463311Z MACHINE_NAME=x86_64 2025-05-07T19:43:04.7463554Z _=/usr/bin/printenv 2025-05-07T19:43:04.7463734Z 2025-05-07T19:43:04.7463858Z ################################################################################ 2025-05-07T19:43:04.7464211Z [INFO] Print ldd version ... 2025-05-07T19:43:04.7464518Z + ldd --version 2025-05-07T19:43:04.7464661Z 2025-05-07T19:43:04.7464785Z ldd (GNU libc) 2.34 2025-05-07T19:43:04.7465076Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:43:04.7465572Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:43:04.7466141Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:43:04.7466645Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:43:04.7466879Z 2025-05-07T19:43:04.7467000Z ################################################################################ 2025-05-07T19:43:04.7467358Z [INFO] Print CPU info ... 2025-05-07T19:43:04.7467613Z + nproc 2025-05-07T19:43:04.7467756Z 2025-05-07T19:43:04.7470336Z 96 2025-05-07T19:43:04.7471655Z 2025-05-07T19:43:04.7472206Z + lscpu 2025-05-07T19:43:04.7472386Z 2025-05-07T19:43:04.7738026Z Architecture: x86_64 2025-05-07T19:43:04.7738694Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:43:04.7739326Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7739782Z Byte Order: Little Endian 2025-05-07T19:43:04.7740122Z CPU(s): 96 2025-05-07T19:43:04.7740546Z On-line CPU(s) list: 0-95 2025-05-07T19:43:04.7740885Z Vendor ID: GenuineIntel 2025-05-07T19:43:04.7741320Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7741723Z CPU family: 6 2025-05-07T19:43:04.7742045Z Model: 85 2025-05-07T19:43:04.7742381Z Thread(s) per core: 2 2025-05-07T19:43:04.7742695Z Core(s) per socket: 24 2025-05-07T19:43:04.7743022Z Socket(s): 2 2025-05-07T19:43:04.7743318Z Stepping: 7 2025-05-07T19:43:04.7743663Z BogoMIPS: 5999.99 2025-05-07T19:43:04.7746051Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7748469Z Hypervisor vendor: KVM 2025-05-07T19:43:04.7748924Z Virtualization type: full 2025-05-07T19:43:04.7749316Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:43:04.7749703Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:43:04.7750118Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:43:04.7750498Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:43:04.7750866Z NUMA node(s): 2 2025-05-07T19:43:04.7751211Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:43:04.7751552Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:43:04.7752148Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:43:04.7752707Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:43:04.7753180Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:43:04.7753780Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:04.7754341Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:43:04.7754947Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:04.7755738Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:43:04.7756119Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:43:04.7756517Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:43:04.7757072Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:43:04.7757683Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:43:04.7758550Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:43:04.7759243Z Vulnerability Srbds: Not affected 2025-05-07T19:43:04.7759839Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:43:04.7760099Z 2025-05-07T19:43:04.7760199Z + cat /proc/cpuinfo 2025-05-07T19:43:04.7760345Z 2025-05-07T19:43:04.7760584Z processor : 0 2025-05-07T19:43:04.7760887Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7761187Z cpu family : 6 2025-05-07T19:43:04.7761410Z model : 85 2025-05-07T19:43:04.7761747Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7762133Z stepping : 7 2025-05-07T19:43:04.7762385Z microcode : 0x5003901 2025-05-07T19:43:04.7762655Z cpu MHz : 1559.160 2025-05-07T19:43:04.7762883Z cache size : 36608 KB 2025-05-07T19:43:04.7763151Z physical id : 0 2025-05-07T19:43:04.7763378Z siblings : 48 2025-05-07T19:43:04.7763633Z core id : 0 2025-05-07T19:43:04.7763852Z cpu cores : 24 2025-05-07T19:43:04.7764099Z apicid : 0 2025-05-07T19:43:04.7764322Z initial apicid : 0 2025-05-07T19:43:04.7764578Z fpu : yes 2025-05-07T19:43:04.7764804Z fpu_exception : yes 2025-05-07T19:43:04.7765075Z cpuid level : 13 2025-05-07T19:43:04.7765332Z wp : yes 2025-05-07T19:43:04.7767630Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7770308Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7770947Z bogomips : 5999.99 2025-05-07T19:43:04.7771182Z clflush size : 64 2025-05-07T19:43:04.7771514Z cache_alignment : 64 2025-05-07T19:43:04.7771807Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7772248Z power management: 2025-05-07T19:43:04.7772391Z 2025-05-07T19:43:04.7772489Z processor : 1 2025-05-07T19:43:04.7772740Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7773033Z cpu family : 6 2025-05-07T19:43:04.7773258Z model : 85 2025-05-07T19:43:04.7773584Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7773949Z stepping : 7 2025-05-07T19:43:04.7774195Z microcode : 0x5003901 2025-05-07T19:43:04.7774441Z cpu MHz : 1622.617 2025-05-07T19:43:04.7774693Z cache size : 36608 KB 2025-05-07T19:43:04.7774943Z physical id : 0 2025-05-07T19:43:04.7775190Z siblings : 48 2025-05-07T19:43:04.7775408Z core id : 1 2025-05-07T19:43:04.7775643Z cpu cores : 24 2025-05-07T19:43:04.7775876Z apicid : 2 2025-05-07T19:43:04.7776116Z initial apicid : 2 2025-05-07T19:43:04.7776379Z fpu : yes 2025-05-07T19:43:04.7776593Z fpu_exception : yes 2025-05-07T19:43:04.7776857Z cpuid level : 13 2025-05-07T19:43:04.7777144Z wp : yes 2025-05-07T19:43:04.7779542Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7782222Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7782826Z bogomips : 5999.99 2025-05-07T19:43:04.7783091Z clflush size : 64 2025-05-07T19:43:04.7783333Z cache_alignment : 64 2025-05-07T19:43:04.7783652Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7783998Z power management: 2025-05-07T19:43:04.7784173Z 2025-05-07T19:43:04.7784268Z processor : 2 2025-05-07T19:43:04.7784527Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7784779Z cpu family : 6 2025-05-07T19:43:04.7785093Z model : 85 2025-05-07T19:43:04.7785385Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7785783Z stepping : 7 2025-05-07T19:43:04.7786024Z microcode : 0x5003901 2025-05-07T19:43:04.7786307Z cpu MHz : 2976.098 2025-05-07T19:43:04.7786544Z cache size : 36608 KB 2025-05-07T19:43:04.7786809Z physical id : 0 2025-05-07T19:43:04.7787032Z siblings : 48 2025-05-07T19:43:04.7787265Z core id : 2 2025-05-07T19:43:04.7787476Z cpu cores : 24 2025-05-07T19:43:04.7787721Z apicid : 4 2025-05-07T19:43:04.7787952Z initial apicid : 4 2025-05-07T19:43:04.7788176Z fpu : yes 2025-05-07T19:43:04.7788410Z fpu_exception : yes 2025-05-07T19:43:04.7788640Z cpuid level : 13 2025-05-07T19:43:04.7788880Z wp : yes 2025-05-07T19:43:04.7791149Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7793940Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7794540Z bogomips : 5999.99 2025-05-07T19:43:04.7794761Z clflush size : 64 2025-05-07T19:43:04.7795008Z cache_alignment : 64 2025-05-07T19:43:04.7795287Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7795645Z power management: 2025-05-07T19:43:04.7795782Z 2025-05-07T19:43:04.7795896Z processor : 3 2025-05-07T19:43:04.7796190Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7796465Z cpu family : 6 2025-05-07T19:43:04.7796687Z model : 85 2025-05-07T19:43:04.7796996Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7797354Z stepping : 7 2025-05-07T19:43:04.7797600Z microcode : 0x5003901 2025-05-07T19:43:04.7797835Z cpu MHz : 2999.996 2025-05-07T19:43:04.7798085Z cache size : 36608 KB 2025-05-07T19:43:04.7798321Z physical id : 0 2025-05-07T19:43:04.7798562Z siblings : 48 2025-05-07T19:43:04.7798774Z core id : 3 2025-05-07T19:43:04.7799009Z cpu cores : 24 2025-05-07T19:43:04.7799242Z apicid : 6 2025-05-07T19:43:04.7799450Z initial apicid : 6 2025-05-07T19:43:04.7799698Z fpu : yes 2025-05-07T19:43:04.7799909Z fpu_exception : yes 2025-05-07T19:43:04.7800172Z cpuid level : 13 2025-05-07T19:43:04.7800391Z wp : yes 2025-05-07T19:43:04.7802642Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7805257Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7805847Z bogomips : 5999.99 2025-05-07T19:43:04.7806110Z clflush size : 64 2025-05-07T19:43:04.7806348Z cache_alignment : 64 2025-05-07T19:43:04.7806666Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7807009Z power management: 2025-05-07T19:43:04.7807176Z 2025-05-07T19:43:04.7807270Z processor : 4 2025-05-07T19:43:04.7807528Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7807786Z cpu family : 6 2025-05-07T19:43:04.7808026Z model : 85 2025-05-07T19:43:04.7808315Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7808761Z stepping : 7 2025-05-07T19:43:04.7808983Z microcode : 0x5003901 2025-05-07T19:43:04.7809256Z cpu MHz : 3247.810 2025-05-07T19:43:04.7809655Z cache size : 36608 KB 2025-05-07T19:43:04.7809954Z physical id : 0 2025-05-07T19:43:04.7810191Z siblings : 48 2025-05-07T19:43:04.7810429Z core id : 4 2025-05-07T19:43:04.7810640Z cpu cores : 24 2025-05-07T19:43:04.7810885Z apicid : 8 2025-05-07T19:43:04.7811118Z initial apicid : 8 2025-05-07T19:43:04.7811342Z fpu : yes 2025-05-07T19:43:04.7811579Z fpu_exception : yes 2025-05-07T19:43:04.7811812Z cpuid level : 13 2025-05-07T19:43:04.7812046Z wp : yes 2025-05-07T19:43:04.7814316Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7816967Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7817585Z bogomips : 5999.99 2025-05-07T19:43:04.7817819Z clflush size : 64 2025-05-07T19:43:04.7818071Z cache_alignment : 64 2025-05-07T19:43:04.7818360Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7818726Z power management: 2025-05-07T19:43:04.7818867Z 2025-05-07T19:43:04.7818988Z processor : 5 2025-05-07T19:43:04.7819292Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7819580Z cpu family : 6 2025-05-07T19:43:04.7819886Z model : 85 2025-05-07T19:43:04.7820253Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7820623Z stepping : 7 2025-05-07T19:43:04.7820878Z microcode : 0x5003901 2025-05-07T19:43:04.7821119Z cpu MHz : 2372.645 2025-05-07T19:43:04.7821379Z cache size : 36608 KB 2025-05-07T19:43:04.7821624Z physical id : 0 2025-05-07T19:43:04.7821880Z siblings : 48 2025-05-07T19:43:04.7822388Z core id : 5 2025-05-07T19:43:04.7822667Z cpu cores : 24 2025-05-07T19:43:04.7822913Z apicid : 10 2025-05-07T19:43:04.7823137Z initial apicid : 10 2025-05-07T19:43:04.7823396Z fpu : yes 2025-05-07T19:43:04.7823610Z fpu_exception : yes 2025-05-07T19:43:04.7823866Z cpuid level : 13 2025-05-07T19:43:04.7824085Z wp : yes 2025-05-07T19:43:04.7826376Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7829026Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7829626Z bogomips : 5999.99 2025-05-07T19:43:04.7829878Z clflush size : 64 2025-05-07T19:43:04.7830107Z cache_alignment : 64 2025-05-07T19:43:04.7830414Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7830753Z power management: 2025-05-07T19:43:04.7830915Z 2025-05-07T19:43:04.7831007Z processor : 6 2025-05-07T19:43:04.7831252Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7831500Z cpu family : 6 2025-05-07T19:43:04.7831737Z model : 85 2025-05-07T19:43:04.7832029Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7832411Z stepping : 7 2025-05-07T19:43:04.7832630Z microcode : 0x5003901 2025-05-07T19:43:04.7833014Z cpu MHz : 2999.996 2025-05-07T19:43:04.7833241Z cache size : 36608 KB 2025-05-07T19:43:04.7833508Z physical id : 0 2025-05-07T19:43:04.7833727Z siblings : 48 2025-05-07T19:43:04.7833964Z core id : 6 2025-05-07T19:43:04.7834199Z cpu cores : 24 2025-05-07T19:43:04.7834515Z apicid : 12 2025-05-07T19:43:04.7834748Z initial apicid : 12 2025-05-07T19:43:04.7834972Z fpu : yes 2025-05-07T19:43:04.7835203Z fpu_exception : yes 2025-05-07T19:43:04.7835426Z cpuid level : 13 2025-05-07T19:43:04.7835671Z wp : yes 2025-05-07T19:43:04.7837878Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7840460Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7841072Z bogomips : 5999.99 2025-05-07T19:43:04.7841301Z clflush size : 64 2025-05-07T19:43:04.7841558Z cache_alignment : 64 2025-05-07T19:43:04.7841842Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7842199Z power management: 2025-05-07T19:43:04.7842339Z 2025-05-07T19:43:04.7842457Z processor : 7 2025-05-07T19:43:04.7842677Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7842942Z cpu family : 6 2025-05-07T19:43:04.7843153Z model : 85 2025-05-07T19:43:04.7843456Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7843892Z stepping : 7 2025-05-07T19:43:04.7844140Z microcode : 0x5003901 2025-05-07T19:43:04.7844383Z cpu MHz : 2999.996 2025-05-07T19:43:04.7844648Z cache size : 36608 KB 2025-05-07T19:43:04.7844885Z physical id : 0 2025-05-07T19:43:04.7845118Z siblings : 48 2025-05-07T19:43:04.7845332Z core id : 7 2025-05-07T19:43:04.7845557Z cpu cores : 24 2025-05-07T19:43:04.7845777Z apicid : 14 2025-05-07T19:43:04.7845997Z initial apicid : 14 2025-05-07T19:43:04.7846251Z fpu : yes 2025-05-07T19:43:04.7846467Z fpu_exception : yes 2025-05-07T19:43:04.7846726Z cpuid level : 13 2025-05-07T19:43:04.7846935Z wp : yes 2025-05-07T19:43:04.7849176Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7851775Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7852362Z bogomips : 5999.99 2025-05-07T19:43:04.7852625Z clflush size : 64 2025-05-07T19:43:04.7852855Z cache_alignment : 64 2025-05-07T19:43:04.7853171Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7853508Z power management: 2025-05-07T19:43:04.7853677Z 2025-05-07T19:43:04.7853772Z processor : 8 2025-05-07T19:43:04.7854024Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7854276Z cpu family : 6 2025-05-07T19:43:04.7854533Z model : 85 2025-05-07T19:43:04.7854814Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7855194Z stepping : 7 2025-05-07T19:43:04.7855414Z microcode : 0x5003901 2025-05-07T19:43:04.7855675Z cpu MHz : 2999.996 2025-05-07T19:43:04.7855898Z cache size : 36608 KB 2025-05-07T19:43:04.7856147Z physical id : 0 2025-05-07T19:43:04.7856409Z siblings : 48 2025-05-07T19:43:04.7856639Z core id : 8 2025-05-07T19:43:04.7856877Z cpu cores : 24 2025-05-07T19:43:04.7857083Z apicid : 16 2025-05-07T19:43:04.7857318Z initial apicid : 16 2025-05-07T19:43:04.7857541Z fpu : yes 2025-05-07T19:43:04.7857769Z fpu_exception : yes 2025-05-07T19:43:04.7857999Z cpuid level : 13 2025-05-07T19:43:04.7858235Z wp : yes 2025-05-07T19:43:04.7860734Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7863402Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7864021Z bogomips : 5999.99 2025-05-07T19:43:04.7864260Z clflush size : 64 2025-05-07T19:43:04.7864518Z cache_alignment : 64 2025-05-07T19:43:04.7864808Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7865173Z power management: 2025-05-07T19:43:04.7865311Z 2025-05-07T19:43:04.7865423Z processor : 9 2025-05-07T19:43:04.7865648Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7865925Z cpu family : 6 2025-05-07T19:43:04.7866138Z model : 85 2025-05-07T19:43:04.7866446Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7866810Z stepping : 7 2025-05-07T19:43:04.7867051Z microcode : 0x5003901 2025-05-07T19:43:04.7867355Z cpu MHz : 2999.996 2025-05-07T19:43:04.7867608Z cache size : 36608 KB 2025-05-07T19:43:04.7867848Z physical id : 0 2025-05-07T19:43:04.7868103Z siblings : 48 2025-05-07T19:43:04.7868325Z core id : 9 2025-05-07T19:43:04.7868569Z cpu cores : 24 2025-05-07T19:43:04.7868816Z apicid : 18 2025-05-07T19:43:04.7869035Z initial apicid : 18 2025-05-07T19:43:04.7869286Z fpu : yes 2025-05-07T19:43:04.7869496Z fpu_exception : yes 2025-05-07T19:43:04.7869747Z cpuid level : 13 2025-05-07T19:43:04.7869962Z wp : yes 2025-05-07T19:43:04.7872331Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7874772Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7875331Z bogomips : 5999.99 2025-05-07T19:43:04.7875565Z clflush size : 64 2025-05-07T19:43:04.7875788Z cache_alignment : 64 2025-05-07T19:43:04.7876078Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7876391Z power management: 2025-05-07T19:43:04.7876543Z 2025-05-07T19:43:04.7876633Z processor : 10 2025-05-07T19:43:04.7876869Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7877105Z cpu family : 6 2025-05-07T19:43:04.7877326Z model : 85 2025-05-07T19:43:04.7877592Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7877955Z stepping : 7 2025-05-07T19:43:04.7878158Z microcode : 0x5003901 2025-05-07T19:43:04.7878406Z cpu MHz : 3202.839 2025-05-07T19:43:04.7878621Z cache size : 36608 KB 2025-05-07T19:43:04.7878861Z physical id : 0 2025-05-07T19:43:04.7879071Z siblings : 48 2025-05-07T19:43:04.7879294Z core id : 10 2025-05-07T19:43:04.7879576Z cpu cores : 24 2025-05-07T19:43:04.7879776Z apicid : 20 2025-05-07T19:43:04.7880001Z initial apicid : 20 2025-05-07T19:43:04.7880220Z fpu : yes 2025-05-07T19:43:04.7880450Z fpu_exception : yes 2025-05-07T19:43:04.7880665Z cpuid level : 13 2025-05-07T19:43:04.7880897Z wp : yes 2025-05-07T19:43:04.7882993Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7885826Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7886456Z bogomips : 5999.99 2025-05-07T19:43:04.7886689Z clflush size : 64 2025-05-07T19:43:04.7886942Z cache_alignment : 64 2025-05-07T19:43:04.7887224Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7887567Z power management: 2025-05-07T19:43:04.7887713Z 2025-05-07T19:43:04.7887829Z processor : 11 2025-05-07T19:43:04.7888058Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7888332Z cpu family : 6 2025-05-07T19:43:04.7888547Z model : 85 2025-05-07T19:43:04.7888866Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7889222Z stepping : 7 2025-05-07T19:43:04.7889436Z microcode : 0x5003901 2025-05-07T19:43:04.7890215Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:04.7890607Z cpu MHz : 1492.327 2025-05-07T19:43:04.7890819Z cache size : 36608 KB 2025-05-07T19:43:04.7891054Z physical id : 0 2025-05-07T19:43:04.7891274Z siblings : 48 2025-05-07T19:43:04.7891476Z core id : 11 2025-05-07T19:43:04.7891700Z cpu cores : 24 2025-05-07T19:43:04.7891896Z apicid : 22 2025-05-07T19:43:04.7892111Z initial apicid : 22 2025-05-07T19:43:04.7892319Z fpu : yes 2025-05-07T19:43:04.7892623Z fpu_exception : yes 2025-05-07T19:43:04.7892859Z cpuid level : 13 2025-05-07T19:43:04.7893106Z wp : yes 2025-05-07T19:43:04.7895397Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7898026Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7898657Z bogomips : 5999.99 2025-05-07T19:43:04.7898890Z clflush size : 64 2025-05-07T19:43:04.7899145Z cache_alignment : 64 2025-05-07T19:43:04.7899531Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7899872Z power management: 2025-05-07T19:43:04.7900008Z 2025-05-07T19:43:04.7900124Z processor : 12 2025-05-07T19:43:04.7900351Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7900681Z cpu family : 6 2025-05-07T19:43:04.7900875Z model : 85 2025-05-07T19:43:04.7901152Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7901494Z stepping : 7 2025-05-07T19:43:04.7901706Z microcode : 0x5003901 2025-05-07T19:43:04.7901925Z cpu MHz : 2999.996 2025-05-07T19:43:04.7902150Z cache size : 36608 KB 2025-05-07T19:43:04.7902385Z physical id : 0 2025-05-07T19:43:04.7902587Z siblings : 48 2025-05-07T19:43:04.7902792Z core id : 12 2025-05-07T19:43:04.7902985Z cpu cores : 24 2025-05-07T19:43:04.7903278Z apicid : 24 2025-05-07T19:43:04.7903478Z initial apicid : 24 2025-05-07T19:43:04.7903696Z fpu : yes 2025-05-07T19:43:04.7903887Z fpu_exception : yes 2025-05-07T19:43:04.7904111Z cpuid level : 13 2025-05-07T19:43:04.7904317Z wp : yes 2025-05-07T19:43:04.7906566Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7909187Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7909769Z bogomips : 5999.99 2025-05-07T19:43:04.7909994Z clflush size : 64 2025-05-07T19:43:04.7910220Z cache_alignment : 64 2025-05-07T19:43:04.7910484Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7910817Z power management: 2025-05-07T19:43:04.7910947Z 2025-05-07T19:43:04.7911031Z processor : 13 2025-05-07T19:43:04.7911255Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7911486Z cpu family : 6 2025-05-07T19:43:04.7911695Z model : 85 2025-05-07T19:43:04.7911962Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7912316Z stepping : 7 2025-05-07T19:43:04.7912645Z microcode : 0x5003901 2025-05-07T19:43:04.7912979Z cpu MHz : 2999.996 2025-05-07T19:43:04.7913185Z cache size : 36608 KB 2025-05-07T19:43:04.7913554Z physical id : 0 2025-05-07T19:43:04.7913886Z siblings : 48 2025-05-07T19:43:04.7914076Z core id : 13 2025-05-07T19:43:04.7914280Z cpu cores : 24 2025-05-07T19:43:04.7914472Z apicid : 26 2025-05-07T19:43:04.7914849Z initial apicid : 26 2025-05-07T19:43:04.7915059Z fpu : yes 2025-05-07T19:43:04.7915380Z fpu_exception : yes 2025-05-07T19:43:04.7915595Z cpuid level : 13 2025-05-07T19:43:04.7915833Z wp : yes 2025-05-07T19:43:04.7918102Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7920736Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7921329Z bogomips : 5999.99 2025-05-07T19:43:04.7921540Z clflush size : 64 2025-05-07T19:43:04.7921762Z cache_alignment : 64 2025-05-07T19:43:04.7922192Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7922512Z power management: 2025-05-07T19:43:04.7922643Z 2025-05-07T19:43:04.7922741Z processor : 14 2025-05-07T19:43:04.7922952Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7923199Z cpu family : 6 2025-05-07T19:43:04.7923395Z model : 85 2025-05-07T19:43:04.7923676Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7924019Z stepping : 7 2025-05-07T19:43:04.7924235Z microcode : 0x5003901 2025-05-07T19:43:04.7924455Z cpu MHz : 2999.996 2025-05-07T19:43:04.7924684Z cache size : 36608 KB 2025-05-07T19:43:04.7924920Z physical id : 0 2025-05-07T19:43:04.7925126Z siblings : 48 2025-05-07T19:43:04.7925341Z core id : 14 2025-05-07T19:43:04.7925536Z cpu cores : 24 2025-05-07T19:43:04.7925744Z apicid : 28 2025-05-07T19:43:04.7925940Z initial apicid : 28 2025-05-07T19:43:04.7926258Z fpu : yes 2025-05-07T19:43:04.7926451Z fpu_exception : yes 2025-05-07T19:43:04.7926678Z cpuid level : 13 2025-05-07T19:43:04.7926880Z wp : yes 2025-05-07T19:43:04.7929149Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7931774Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7932351Z bogomips : 5999.99 2025-05-07T19:43:04.7932582Z clflush size : 64 2025-05-07T19:43:04.7932807Z cache_alignment : 64 2025-05-07T19:43:04.7933073Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7933407Z power management: 2025-05-07T19:43:04.7933538Z 2025-05-07T19:43:04.7933624Z processor : 15 2025-05-07T19:43:04.7933853Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7934084Z cpu family : 6 2025-05-07T19:43:04.7934289Z model : 85 2025-05-07T19:43:04.7934560Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7934922Z stepping : 7 2025-05-07T19:43:04.7935127Z microcode : 0x5003901 2025-05-07T19:43:04.7935365Z cpu MHz : 2999.996 2025-05-07T19:43:04.7935591Z cache size : 36608 KB 2025-05-07T19:43:04.7935814Z physical id : 0 2025-05-07T19:43:04.7936036Z siblings : 48 2025-05-07T19:43:04.7936235Z core id : 15 2025-05-07T19:43:04.7936518Z cpu cores : 24 2025-05-07T19:43:04.7936716Z apicid : 30 2025-05-07T19:43:04.7936931Z initial apicid : 30 2025-05-07T19:43:04.7937143Z fpu : yes 2025-05-07T19:43:04.7937356Z fpu_exception : yes 2025-05-07T19:43:04.7937568Z cpuid level : 13 2025-05-07T19:43:04.7937782Z wp : yes 2025-05-07T19:43:04.7940115Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7942732Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7943320Z bogomips : 5999.99 2025-05-07T19:43:04.7943532Z clflush size : 64 2025-05-07T19:43:04.7943794Z cache_alignment : 64 2025-05-07T19:43:04.7944103Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7944441Z power management: 2025-05-07T19:43:04.7944583Z 2025-05-07T19:43:04.7944697Z processor : 16 2025-05-07T19:43:04.7944923Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7945200Z cpu family : 6 2025-05-07T19:43:04.7945411Z model : 85 2025-05-07T19:43:04.7945721Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7946083Z stepping : 7 2025-05-07T19:43:04.7946313Z microcode : 0x5003901 2025-05-07T19:43:04.7946531Z cpu MHz : 2999.996 2025-05-07T19:43:04.7946751Z cache size : 36608 KB 2025-05-07T19:43:04.7946982Z physical id : 0 2025-05-07T19:43:04.7947184Z siblings : 48 2025-05-07T19:43:04.7947394Z core id : 16 2025-05-07T19:43:04.7947606Z cpu cores : 24 2025-05-07T19:43:04.7947843Z apicid : 32 2025-05-07T19:43:04.7948060Z initial apicid : 32 2025-05-07T19:43:04.7948309Z fpu : yes 2025-05-07T19:43:04.7948517Z fpu_exception : yes 2025-05-07T19:43:04.7948831Z cpuid level : 13 2025-05-07T19:43:04.7949051Z wp : yes 2025-05-07T19:43:04.7951350Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7954000Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7954603Z bogomips : 5999.99 2025-05-07T19:43:04.7954866Z clflush size : 64 2025-05-07T19:43:04.7955109Z cache_alignment : 64 2025-05-07T19:43:04.7955377Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7955726Z power management: 2025-05-07T19:43:04.7955864Z 2025-05-07T19:43:04.7955961Z processor : 17 2025-05-07T19:43:04.7956213Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7956464Z cpu family : 6 2025-05-07T19:43:04.7956699Z model : 85 2025-05-07T19:43:04.7956981Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7957371Z stepping : 7 2025-05-07T19:43:04.7957591Z microcode : 0x5003901 2025-05-07T19:43:04.7957852Z cpu MHz : 2999.996 2025-05-07T19:43:04.7958106Z cache size : 36608 KB 2025-05-07T19:43:04.7958343Z physical id : 0 2025-05-07T19:43:04.7958589Z siblings : 48 2025-05-07T19:43:04.7958801Z core id : 17 2025-05-07T19:43:04.7959042Z cpu cores : 24 2025-05-07T19:43:04.7959258Z apicid : 34 2025-05-07T19:43:04.7959550Z initial apicid : 34 2025-05-07T19:43:04.7959782Z fpu : yes 2025-05-07T19:43:04.7960022Z fpu_exception : yes 2025-05-07T19:43:04.7960255Z cpuid level : 13 2025-05-07T19:43:04.7960566Z wp : yes 2025-05-07T19:43:04.7962863Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7965485Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7966104Z bogomips : 5999.99 2025-05-07T19:43:04.7966360Z clflush size : 64 2025-05-07T19:43:04.7966592Z cache_alignment : 64 2025-05-07T19:43:04.7966895Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7967235Z power management: 2025-05-07T19:43:04.7967381Z 2025-05-07T19:43:04.7967504Z processor : 18 2025-05-07T19:43:04.7967737Z vendor_id : GenuineIntel 2025-05-07T19:43:04.7968015Z cpu family : 6 2025-05-07T19:43:04.7968235Z model : 85 2025-05-07T19:43:04.7968547Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.7968915Z stepping : 7 2025-05-07T19:43:04.7969165Z microcode : 0x5003901 2025-05-07T19:43:04.7969406Z cpu MHz : 2999.996 2025-05-07T19:43:04.7969656Z cache size : 36608 KB 2025-05-07T19:43:04.7969922Z physical id : 0 2025-05-07T19:43:04.7970142Z siblings : 48 2025-05-07T19:43:04.7970377Z core id : 18 2025-05-07T19:43:04.7970591Z cpu cores : 24 2025-05-07T19:43:04.7970839Z apicid : 36 2025-05-07T19:43:04.7971053Z initial apicid : 36 2025-05-07T19:43:04.7971305Z fpu : yes 2025-05-07T19:43:04.7971521Z fpu_exception : yes 2025-05-07T19:43:04.7971786Z cpuid level : 13 2025-05-07T19:43:04.7972075Z wp : yes 2025-05-07T19:43:04.7974379Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.7978949Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.7979659Z bogomips : 5999.99 2025-05-07T19:43:04.7979930Z clflush size : 64 2025-05-07T19:43:04.7980205Z cache_alignment : 64 2025-05-07T19:43:04.7980570Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.7980943Z power management: 2025-05-07T19:43:04.7981096Z 2025-05-07T19:43:04.7981192Z processor : 19 2025-05-07T19:43:04.7981468Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8003101Z cpu family : 6 2025-05-07T19:43:04.8003564Z model : 85 2025-05-07T19:43:04.8004207Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8004693Z stepping : 7 2025-05-07T19:43:04.8004934Z microcode : 0x5003901 2025-05-07T19:43:04.8005209Z cpu MHz : 2999.996 2025-05-07T19:43:04.8005445Z cache size : 36608 KB 2025-05-07T19:43:04.8005715Z physical id : 0 2025-05-07T19:43:04.8005945Z siblings : 48 2025-05-07T19:43:04.8006199Z core id : 19 2025-05-07T19:43:04.8006417Z cpu cores : 24 2025-05-07T19:43:04.8006657Z apicid : 38 2025-05-07T19:43:04.8006874Z initial apicid : 38 2025-05-07T19:43:04.8007107Z fpu : yes 2025-05-07T19:43:04.8007319Z fpu_exception : yes 2025-05-07T19:43:04.8007718Z cpuid level : 13 2025-05-07T19:43:04.8007968Z wp : yes 2025-05-07T19:43:04.8010234Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8012911Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8013532Z bogomips : 5999.99 2025-05-07T19:43:04.8013772Z clflush size : 64 2025-05-07T19:43:04.8014035Z cache_alignment : 64 2025-05-07T19:43:04.8014329Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8014699Z power management: 2025-05-07T19:43:04.8014845Z 2025-05-07T19:43:04.8014948Z processor : 20 2025-05-07T19:43:04.8015195Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8015446Z cpu family : 6 2025-05-07T19:43:04.8015682Z model : 85 2025-05-07T19:43:04.8015969Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8016357Z stepping : 7 2025-05-07T19:43:04.8016580Z microcode : 0x5003901 2025-05-07T19:43:04.8016833Z cpu MHz : 2999.996 2025-05-07T19:43:04.8017161Z cache size : 36608 KB 2025-05-07T19:43:04.8017408Z physical id : 0 2025-05-07T19:43:04.8017616Z siblings : 48 2025-05-07T19:43:04.8017838Z core id : 20 2025-05-07T19:43:04.8018037Z cpu cores : 24 2025-05-07T19:43:04.8018266Z apicid : 40 2025-05-07T19:43:04.8018503Z initial apicid : 40 2025-05-07T19:43:04.8018723Z fpu : yes 2025-05-07T19:43:04.8018938Z fpu_exception : yes 2025-05-07T19:43:04.8019249Z cpuid level : 13 2025-05-07T19:43:04.8019496Z wp : yes 2025-05-07T19:43:04.8021920Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8024944Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8025546Z bogomips : 5999.99 2025-05-07T19:43:04.8025768Z clflush size : 64 2025-05-07T19:43:04.8026014Z cache_alignment : 64 2025-05-07T19:43:04.8026295Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8026642Z power management: 2025-05-07T19:43:04.8026777Z 2025-05-07T19:43:04.8026890Z processor : 21 2025-05-07T19:43:04.8027106Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8027358Z cpu family : 6 2025-05-07T19:43:04.8027552Z model : 85 2025-05-07T19:43:04.8027822Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8028176Z stepping : 7 2025-05-07T19:43:04.8028398Z microcode : 0x5003901 2025-05-07T19:43:04.8028612Z cpu MHz : 2999.996 2025-05-07T19:43:04.8028820Z cache size : 36608 KB 2025-05-07T19:43:04.8029041Z physical id : 0 2025-05-07T19:43:04.8029265Z siblings : 48 2025-05-07T19:43:04.8029469Z core id : 21 2025-05-07T19:43:04.8029684Z cpu cores : 24 2025-05-07T19:43:04.8029896Z apicid : 42 2025-05-07T19:43:04.8030096Z initial apicid : 42 2025-05-07T19:43:04.8030326Z fpu : yes 2025-05-07T19:43:04.8030528Z fpu_exception : yes 2025-05-07T19:43:04.8030756Z cpuid level : 13 2025-05-07T19:43:04.8030961Z wp : yes 2025-05-07T19:43:04.8033316Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8036009Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8036571Z bogomips : 5999.99 2025-05-07T19:43:04.8036799Z clflush size : 64 2025-05-07T19:43:04.8037013Z cache_alignment : 64 2025-05-07T19:43:04.8037296Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8037621Z power management: 2025-05-07T19:43:04.8037770Z 2025-05-07T19:43:04.8037855Z processor : 22 2025-05-07T19:43:04.8038088Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8038322Z cpu family : 6 2025-05-07T19:43:04.8038544Z model : 85 2025-05-07T19:43:04.8038810Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8039172Z stepping : 7 2025-05-07T19:43:04.8039373Z microcode : 0x5003901 2025-05-07T19:43:04.8039607Z cpu MHz : 2999.996 2025-05-07T19:43:04.8039820Z cache size : 36608 KB 2025-05-07T19:43:04.8040054Z physical id : 0 2025-05-07T19:43:04.8040259Z siblings : 48 2025-05-07T19:43:04.8040466Z core id : 22 2025-05-07T19:43:04.8040663Z cpu cores : 24 2025-05-07T19:43:04.8040864Z apicid : 44 2025-05-07T19:43:04.8041083Z initial apicid : 44 2025-05-07T19:43:04.8041289Z fpu : yes 2025-05-07T19:43:04.8041500Z fpu_exception : yes 2025-05-07T19:43:04.8041713Z cpuid level : 13 2025-05-07T19:43:04.8041925Z wp : yes 2025-05-07T19:43:04.8044116Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8046735Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8047288Z bogomips : 5999.99 2025-05-07T19:43:04.8047489Z clflush size : 64 2025-05-07T19:43:04.8047712Z cache_alignment : 64 2025-05-07T19:43:04.8047969Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8048295Z power management: 2025-05-07T19:43:04.8048420Z 2025-05-07T19:43:04.8048526Z processor : 23 2025-05-07T19:43:04.8048727Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8048969Z cpu family : 6 2025-05-07T19:43:04.8049163Z model : 85 2025-05-07T19:43:04.8049432Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8049760Z stepping : 7 2025-05-07T19:43:04.8049966Z microcode : 0x5003901 2025-05-07T19:43:04.8050182Z cpu MHz : 2999.996 2025-05-07T19:43:04.8050402Z cache size : 36608 KB 2025-05-07T19:43:04.8050612Z physical id : 0 2025-05-07T19:43:04.8050827Z siblings : 48 2025-05-07T19:43:04.8051014Z core id : 23 2025-05-07T19:43:04.8051219Z cpu cores : 24 2025-05-07T19:43:04.8051424Z apicid : 46 2025-05-07T19:43:04.8051611Z initial apicid : 46 2025-05-07T19:43:04.8051825Z fpu : yes 2025-05-07T19:43:04.8052017Z fpu_exception : yes 2025-05-07T19:43:04.8052238Z cpuid level : 13 2025-05-07T19:43:04.8052427Z wp : yes 2025-05-07T19:43:04.8054571Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8057013Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8057559Z bogomips : 5999.99 2025-05-07T19:43:04.8057785Z clflush size : 64 2025-05-07T19:43:04.8057996Z cache_alignment : 64 2025-05-07T19:43:04.8058279Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8058585Z power management: 2025-05-07T19:43:04.8058729Z 2025-05-07T19:43:04.8058810Z processor : 24 2025-05-07T19:43:04.8059042Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8059337Z cpu family : 6 2025-05-07T19:43:04.8059553Z model : 85 2025-05-07T19:43:04.8060000Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8060363Z stepping : 7 2025-05-07T19:43:04.8060645Z microcode : 0x5003901 2025-05-07T19:43:04.8060873Z cpu MHz : 3255.437 2025-05-07T19:43:04.8061090Z cache size : 36608 KB 2025-05-07T19:43:04.8061332Z physical id : 1 2025-05-07T19:43:04.8061535Z siblings : 48 2025-05-07T19:43:04.8061746Z core id : 0 2025-05-07T19:43:04.8061955Z cpu cores : 24 2025-05-07T19:43:04.8062151Z apicid : 64 2025-05-07T19:43:04.8062363Z initial apicid : 64 2025-05-07T19:43:04.8062577Z fpu : yes 2025-05-07T19:43:04.8062786Z fpu_exception : yes 2025-05-07T19:43:04.8063001Z cpuid level : 13 2025-05-07T19:43:04.8063223Z wp : yes 2025-05-07T19:43:04.8065474Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8068164Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8068823Z bogomips : 5999.99 2025-05-07T19:43:04.8069033Z clflush size : 64 2025-05-07T19:43:04.8069265Z cache_alignment : 64 2025-05-07T19:43:04.8069530Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8069860Z power management: 2025-05-07T19:43:04.8069992Z 2025-05-07T19:43:04.8070088Z processor : 25 2025-05-07T19:43:04.8070305Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8070558Z cpu family : 6 2025-05-07T19:43:04.8070764Z model : 85 2025-05-07T19:43:04.8071032Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8071376Z stepping : 7 2025-05-07T19:43:04.8071576Z microcode : 0x5003901 2025-05-07T19:43:04.8071896Z cpu MHz : 3235.602 2025-05-07T19:43:04.8072085Z cache size : 36608 KB 2025-05-07T19:43:04.8072280Z physical id : 1 2025-05-07T19:43:04.8072467Z siblings : 48 2025-05-07T19:43:04.8072640Z core id : 1 2025-05-07T19:43:04.8072827Z cpu cores : 24 2025-05-07T19:43:04.8073002Z apicid : 66 2025-05-07T19:43:04.8073189Z initial apicid : 66 2025-05-07T19:43:04.8073368Z fpu : yes 2025-05-07T19:43:04.8073549Z fpu_exception : yes 2025-05-07T19:43:04.8073741Z cpuid level : 13 2025-05-07T19:43:04.8073917Z wp : yes 2025-05-07T19:43:04.8076040Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8078467Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8078996Z bogomips : 5999.99 2025-05-07T19:43:04.8079191Z clflush size : 64 2025-05-07T19:43:04.8079383Z cache_alignment : 64 2025-05-07T19:43:04.8079625Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8079908Z power management: 2025-05-07T19:43:04.8080031Z 2025-05-07T19:43:04.8080109Z processor : 26 2025-05-07T19:43:04.8080300Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8080517Z cpu family : 6 2025-05-07T19:43:04.8080701Z model : 85 2025-05-07T19:43:04.8080934Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8081248Z stepping : 7 2025-05-07T19:43:04.8081430Z microcode : 0x5003901 2025-05-07T19:43:04.8081644Z cpu MHz : 2999.996 2025-05-07T19:43:04.8081849Z cache size : 36608 KB 2025-05-07T19:43:04.8082060Z physical id : 1 2025-05-07T19:43:04.8082248Z siblings : 48 2025-05-07T19:43:04.8082440Z core id : 2 2025-05-07T19:43:04.8082619Z cpu cores : 24 2025-05-07T19:43:04.8082822Z apicid : 68 2025-05-07T19:43:04.8083009Z initial apicid : 68 2025-05-07T19:43:04.8083216Z fpu : yes 2025-05-07T19:43:04.8083399Z fpu_exception : yes 2025-05-07T19:43:04.8083588Z cpuid level : 13 2025-05-07T19:43:04.8083769Z wp : yes 2025-05-07T19:43:04.8085847Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8088297Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8088828Z bogomips : 5999.99 2025-05-07T19:43:04.8089028Z clflush size : 64 2025-05-07T19:43:04.8089245Z cache_alignment : 64 2025-05-07T19:43:04.8089491Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8089798Z power management: 2025-05-07T19:43:04.8089923Z 2025-05-07T19:43:04.8090008Z processor : 27 2025-05-07T19:43:04.8090213Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8090438Z cpu family : 6 2025-05-07T19:43:04.8090632Z model : 85 2025-05-07T19:43:04.8090884Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8091199Z stepping : 7 2025-05-07T19:43:04.8091392Z microcode : 0x5003901 2025-05-07T19:43:04.8091605Z cpu MHz : 3264.105 2025-05-07T19:43:04.8091798Z cache size : 36608 KB 2025-05-07T19:43:04.8091997Z physical id : 1 2025-05-07T19:43:04.8092197Z siblings : 48 2025-05-07T19:43:04.8092381Z core id : 3 2025-05-07T19:43:04.8092573Z cpu cores : 24 2025-05-07T19:43:04.8092754Z apicid : 70 2025-05-07T19:43:04.8092951Z initial apicid : 70 2025-05-07T19:43:04.8093144Z fpu : yes 2025-05-07T19:43:04.8093337Z fpu_exception : yes 2025-05-07T19:43:04.8093543Z cpuid level : 13 2025-05-07T19:43:04.8093731Z wp : yes 2025-05-07T19:43:04.8095887Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8098324Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8098858Z bogomips : 5999.99 2025-05-07T19:43:04.8099072Z clflush size : 64 2025-05-07T19:43:04.8099343Z cache_alignment : 64 2025-05-07T19:43:04.8099770Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8100089Z power management: 2025-05-07T19:43:04.8100242Z 2025-05-07T19:43:04.8100326Z processor : 28 2025-05-07T19:43:04.8100630Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8100872Z cpu family : 6 2025-05-07T19:43:04.8101082Z model : 85 2025-05-07T19:43:04.8101351Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8101712Z stepping : 7 2025-05-07T19:43:04.8101919Z microcode : 0x5003901 2025-05-07T19:43:04.8102140Z cpu MHz : 3230.983 2025-05-07T19:43:04.8102365Z cache size : 36608 KB 2025-05-07T19:43:04.8102598Z physical id : 1 2025-05-07T19:43:04.8102802Z siblings : 48 2025-05-07T19:43:04.8103007Z core id : 4 2025-05-07T19:43:04.8103202Z cpu cores : 24 2025-05-07T19:43:04.8103405Z apicid : 72 2025-05-07T19:43:04.8103603Z initial apicid : 72 2025-05-07T19:43:04.8103822Z fpu : yes 2025-05-07T19:43:04.8104031Z fpu_exception : yes 2025-05-07T19:43:04.8104251Z cpuid level : 13 2025-05-07T19:43:04.8104455Z wp : yes 2025-05-07T19:43:04.8106727Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8109413Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8110008Z bogomips : 5999.99 2025-05-07T19:43:04.8110221Z clflush size : 64 2025-05-07T19:43:04.8110445Z cache_alignment : 64 2025-05-07T19:43:04.8110711Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8111046Z power management: 2025-05-07T19:43:04.8111178Z 2025-05-07T19:43:04.8111264Z processor : 29 2025-05-07T19:43:04.8111491Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8111747Z cpu family : 6 2025-05-07T19:43:04.8112041Z model : 85 2025-05-07T19:43:04.8112288Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8112607Z stepping : 7 2025-05-07T19:43:04.8112799Z microcode : 0x5003901 2025-05-07T19:43:04.8112994Z cpu MHz : 3196.269 2025-05-07T19:43:04.8113193Z cache size : 36608 KB 2025-05-07T19:43:04.8113397Z physical id : 1 2025-05-07T19:43:04.8113589Z siblings : 48 2025-05-07T19:43:04.8113767Z core id : 5 2025-05-07T19:43:04.8113960Z cpu cores : 24 2025-05-07T19:43:04.8114172Z apicid : 74 2025-05-07T19:43:04.8114398Z initial apicid : 74 2025-05-07T19:43:04.8114621Z fpu : yes 2025-05-07T19:43:04.8114856Z fpu_exception : yes 2025-05-07T19:43:04.8115104Z cpuid level : 13 2025-05-07T19:43:04.8115321Z wp : yes 2025-05-07T19:43:04.8117526Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8119980Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8120548Z bogomips : 5999.99 2025-05-07T19:43:04.8120798Z clflush size : 64 2025-05-07T19:43:04.8121026Z cache_alignment : 64 2025-05-07T19:43:04.8121327Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8121645Z power management: 2025-05-07T19:43:04.8121799Z 2025-05-07T19:43:04.8121887Z processor : 30 2025-05-07T19:43:04.8122280Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8122719Z cpu family : 6 2025-05-07T19:43:04.8122953Z model : 85 2025-05-07T19:43:04.8123296Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8123687Z stepping : 7 2025-05-07T19:43:04.8123908Z microcode : 0x5003901 2025-05-07T19:43:04.8124172Z cpu MHz : 2999.996 2025-05-07T19:43:04.8124402Z cache size : 36608 KB 2025-05-07T19:43:04.8124667Z physical id : 1 2025-05-07T19:43:04.8124896Z siblings : 48 2025-05-07T19:43:04.8125142Z core id : 6 2025-05-07T19:43:04.8125359Z cpu cores : 24 2025-05-07T19:43:04.8125600Z apicid : 76 2025-05-07T19:43:04.8125819Z initial apicid : 76 2025-05-07T19:43:04.8126071Z fpu : yes 2025-05-07T19:43:04.8126308Z fpu_exception : yes 2025-05-07T19:43:04.8126539Z cpuid level : 13 2025-05-07T19:43:04.8126782Z wp : yes 2025-05-07T19:43:04.8129047Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8131798Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8132420Z bogomips : 5999.99 2025-05-07T19:43:04.8132662Z clflush size : 64 2025-05-07T19:43:04.8132931Z cache_alignment : 64 2025-05-07T19:43:04.8133224Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8133599Z power management: 2025-05-07T19:43:04.8133738Z 2025-05-07T19:43:04.8133836Z processor : 31 2025-05-07T19:43:04.8134094Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8134371Z cpu family : 6 2025-05-07T19:43:04.8134587Z model : 85 2025-05-07T19:43:04.8134896Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8135464Z stepping : 7 2025-05-07T19:43:04.8135691Z microcode : 0x5003901 2025-05-07T19:43:04.8135918Z cpu MHz : 2999.996 2025-05-07T19:43:04.8136157Z cache size : 36608 KB 2025-05-07T19:43:04.8136382Z physical id : 1 2025-05-07T19:43:04.8136609Z siblings : 48 2025-05-07T19:43:04.8136806Z core id : 7 2025-05-07T19:43:04.8137025Z cpu cores : 24 2025-05-07T19:43:04.8137225Z apicid : 78 2025-05-07T19:43:04.8137449Z initial apicid : 78 2025-05-07T19:43:04.8137661Z fpu : yes 2025-05-07T19:43:04.8137883Z fpu_exception : yes 2025-05-07T19:43:04.8138118Z cpuid level : 13 2025-05-07T19:43:04.8138329Z wp : yes 2025-05-07T19:43:04.8140832Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8143496Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8144103Z bogomips : 5999.99 2025-05-07T19:43:04.8144352Z clflush size : 64 2025-05-07T19:43:04.8144583Z cache_alignment : 64 2025-05-07T19:43:04.8144889Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8145227Z power management: 2025-05-07T19:43:04.8145388Z 2025-05-07T19:43:04.8145479Z processor : 32 2025-05-07T19:43:04.8145707Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8145981Z cpu family : 6 2025-05-07T19:43:04.8146214Z model : 85 2025-05-07T19:43:04.8146499Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8146890Z stepping : 7 2025-05-07T19:43:04.8147108Z microcode : 0x5003901 2025-05-07T19:43:04.8147371Z cpu MHz : 2999.996 2025-05-07T19:43:04.8147605Z cache size : 36608 KB 2025-05-07T19:43:04.8147875Z physical id : 1 2025-05-07T19:43:04.8148102Z siblings : 48 2025-05-07T19:43:04.8148347Z core id : 8 2025-05-07T19:43:04.8148563Z cpu cores : 24 2025-05-07T19:43:04.8148803Z apicid : 80 2025-05-07T19:43:04.8149014Z initial apicid : 80 2025-05-07T19:43:04.8149263Z fpu : yes 2025-05-07T19:43:04.8149494Z fpu_exception : yes 2025-05-07T19:43:04.8149726Z cpuid level : 13 2025-05-07T19:43:04.8149971Z wp : yes 2025-05-07T19:43:04.8152334Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8154790Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8155402Z bogomips : 5999.99 2025-05-07T19:43:04.8155605Z clflush size : 64 2025-05-07T19:43:04.8155818Z cache_alignment : 64 2025-05-07T19:43:04.8156085Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8156412Z power management: 2025-05-07T19:43:04.8156536Z 2025-05-07T19:43:04.8156612Z processor : 33 2025-05-07T19:43:04.8156838Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8157069Z cpu family : 6 2025-05-07T19:43:04.8157266Z model : 85 2025-05-07T19:43:04.8157531Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8157860Z stepping : 7 2025-05-07T19:43:04.8158059Z microcode : 0x5003901 2025-05-07T19:43:04.8158263Z cpu MHz : 2999.996 2025-05-07T19:43:04.8158481Z cache size : 36608 KB 2025-05-07T19:43:04.8158691Z physical id : 1 2025-05-07T19:43:04.8158903Z siblings : 48 2025-05-07T19:43:04.8159087Z core id : 9 2025-05-07T19:43:04.8159285Z cpu cores : 24 2025-05-07T19:43:04.8159472Z apicid : 82 2025-05-07T19:43:04.8159679Z initial apicid : 82 2025-05-07T19:43:04.8159884Z fpu : yes 2025-05-07T19:43:04.8160082Z fpu_exception : yes 2025-05-07T19:43:04.8160313Z cpuid level : 13 2025-05-07T19:43:04.8160497Z wp : yes 2025-05-07T19:43:04.8162603Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8165095Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8165635Z bogomips : 5999.99 2025-05-07T19:43:04.8165854Z clflush size : 64 2025-05-07T19:43:04.8166052Z cache_alignment : 64 2025-05-07T19:43:04.8166323Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8166615Z power management: 2025-05-07T19:43:04.8166759Z 2025-05-07T19:43:04.8166839Z processor : 34 2025-05-07T19:43:04.8167037Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8167262Z cpu family : 6 2025-05-07T19:43:04.8167457Z model : 85 2025-05-07T19:43:04.8167715Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8168048Z stepping : 7 2025-05-07T19:43:04.8168245Z microcode : 0x5003901 2025-05-07T19:43:04.8168471Z cpu MHz : 2999.996 2025-05-07T19:43:04.8168660Z cache size : 36608 KB 2025-05-07T19:43:04.8168875Z physical id : 1 2025-05-07T19:43:04.8169069Z siblings : 48 2025-05-07T19:43:04.8169257Z core id : 10 2025-05-07T19:43:04.8169437Z cpu cores : 24 2025-05-07T19:43:04.8169622Z apicid : 84 2025-05-07T19:43:04.8169809Z initial apicid : 84 2025-05-07T19:43:04.8170016Z fpu : yes 2025-05-07T19:43:04.8170223Z fpu_exception : yes 2025-05-07T19:43:04.8170305Z cpuid level : 13 2025-05-07T19:43:04.8170376Z wp : yes 2025-05-07T19:43:04.8172372Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8172759Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8172835Z bogomips : 5999.99 2025-05-07T19:43:04.8172932Z clflush size : 64 2025-05-07T19:43:04.8173072Z cache_alignment : 64 2025-05-07T19:43:04.8173193Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8173276Z power management: 2025-05-07T19:43:04.8173280Z 2025-05-07T19:43:04.8173376Z processor : 35 2025-05-07T19:43:04.8173469Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8173546Z cpu family : 6 2025-05-07T19:43:04.8173640Z model : 85 2025-05-07T19:43:04.8173794Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8173874Z stepping : 7 2025-05-07T19:43:04.8173956Z microcode : 0x5003901 2025-05-07T19:43:04.8174046Z cpu MHz : 2999.996 2025-05-07T19:43:04.8174124Z cache size : 36608 KB 2025-05-07T19:43:04.8174204Z physical id : 1 2025-05-07T19:43:04.8174293Z siblings : 48 2025-05-07T19:43:04.8174370Z core id : 11 2025-05-07T19:43:04.8174446Z cpu cores : 24 2025-05-07T19:43:04.8174520Z apicid : 86 2025-05-07T19:43:04.8174617Z initial apicid : 86 2025-05-07T19:43:04.8174692Z fpu : yes 2025-05-07T19:43:04.8174777Z fpu_exception : yes 2025-05-07T19:43:04.8174854Z cpuid level : 13 2025-05-07T19:43:04.8174947Z wp : yes 2025-05-07T19:43:04.8177103Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8177566Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8177650Z bogomips : 5999.99 2025-05-07T19:43:04.8177735Z clflush size : 64 2025-05-07T19:43:04.8177835Z cache_alignment : 64 2025-05-07T19:43:04.8177967Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8178057Z power management: 2025-05-07T19:43:04.8178061Z 2025-05-07T19:43:04.8178141Z processor : 36 2025-05-07T19:43:04.8178254Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8178341Z cpu family : 6 2025-05-07T19:43:04.8178425Z model : 85 2025-05-07T19:43:04.8178599Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8178681Z stepping : 7 2025-05-07T19:43:04.8178767Z microcode : 0x5003901 2025-05-07T19:43:04.8178847Z cpu MHz : 3216.937 2025-05-07T19:43:04.8178941Z cache size : 36608 KB 2025-05-07T19:43:04.8179024Z physical id : 1 2025-05-07T19:43:04.8179107Z siblings : 48 2025-05-07T19:43:04.8179258Z core id : 12 2025-05-07T19:43:04.8179348Z cpu cores : 24 2025-05-07T19:43:04.8179432Z apicid : 88 2025-05-07T19:43:04.8179531Z initial apicid : 88 2025-05-07T19:43:04.8179787Z fpu : yes 2025-05-07T19:43:04.8179880Z fpu_exception : yes 2025-05-07T19:43:04.8179965Z cpuid level : 13 2025-05-07T19:43:04.8180047Z wp : yes 2025-05-07T19:43:04.8182196Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8182590Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8182696Z bogomips : 5999.99 2025-05-07T19:43:04.8182776Z clflush size : 64 2025-05-07T19:43:04.8182861Z cache_alignment : 64 2025-05-07T19:43:04.8182995Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8183164Z power management: 2025-05-07T19:43:04.8183169Z 2025-05-07T19:43:04.8183252Z processor : 37 2025-05-07T19:43:04.8183348Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8183451Z cpu family : 6 2025-05-07T19:43:04.8183530Z model : 85 2025-05-07T19:43:04.8183688Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8183788Z stepping : 7 2025-05-07T19:43:04.8183877Z microcode : 0x5003901 2025-05-07T19:43:04.8183955Z cpu MHz : 3275.641 2025-05-07T19:43:04.8184042Z cache size : 36608 KB 2025-05-07T19:43:04.8184152Z physical id : 1 2025-05-07T19:43:04.8184229Z siblings : 48 2025-05-07T19:43:04.8184302Z core id : 13 2025-05-07T19:43:04.8184377Z cpu cores : 24 2025-05-07T19:43:04.8184473Z apicid : 90 2025-05-07T19:43:04.8184555Z initial apicid : 90 2025-05-07T19:43:04.8184633Z fpu : yes 2025-05-07T19:43:04.8184730Z fpu_exception : yes 2025-05-07T19:43:04.8184812Z cpuid level : 13 2025-05-07T19:43:04.8184888Z wp : yes 2025-05-07T19:43:04.8187060Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8187449Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8187533Z bogomips : 5999.99 2025-05-07T19:43:04.8187669Z clflush size : 64 2025-05-07T19:43:04.8187765Z cache_alignment : 64 2025-05-07T19:43:04.8187893Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8187979Z power management: 2025-05-07T19:43:04.8188000Z 2025-05-07T19:43:04.8188088Z processor : 38 2025-05-07T19:43:04.8188176Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8188258Z cpu family : 6 2025-05-07T19:43:04.8188353Z model : 85 2025-05-07T19:43:04.8188513Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8188591Z stepping : 7 2025-05-07T19:43:04.8188677Z microcode : 0x5003901 2025-05-07T19:43:04.8188783Z cpu MHz : 3269.726 2025-05-07T19:43:04.8188866Z cache size : 36608 KB 2025-05-07T19:43:04.8188942Z physical id : 1 2025-05-07T19:43:04.8189034Z siblings : 48 2025-05-07T19:43:04.8189120Z core id : 14 2025-05-07T19:43:04.8189204Z cpu cores : 24 2025-05-07T19:43:04.8189284Z apicid : 92 2025-05-07T19:43:04.8189392Z initial apicid : 92 2025-05-07T19:43:04.8189479Z fpu : yes 2025-05-07T19:43:04.8189568Z fpu_exception : yes 2025-05-07T19:43:04.8189672Z cpuid level : 13 2025-05-07T19:43:04.8189753Z wp : yes 2025-05-07T19:43:04.8191982Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8192361Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8192439Z bogomips : 5999.99 2025-05-07T19:43:04.8192519Z clflush size : 64 2025-05-07T19:43:04.8192603Z cache_alignment : 64 2025-05-07T19:43:04.8192720Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8192798Z power management: 2025-05-07T19:43:04.8192849Z 2025-05-07T19:43:04.8192922Z processor : 39 2025-05-07T19:43:04.8193008Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8193075Z cpu family : 6 2025-05-07T19:43:04.8193140Z model : 85 2025-05-07T19:43:04.8193285Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8193360Z stepping : 7 2025-05-07T19:43:04.8193438Z microcode : 0x5003901 2025-05-07T19:43:04.8193508Z cpu MHz : 3109.629 2025-05-07T19:43:04.8193589Z cache size : 36608 KB 2025-05-07T19:43:04.8193666Z physical id : 1 2025-05-07T19:43:04.8193735Z siblings : 48 2025-05-07T19:43:04.8193814Z core id : 15 2025-05-07T19:43:04.8193886Z cpu cores : 24 2025-05-07T19:43:04.8193956Z apicid : 94 2025-05-07T19:43:04.8194030Z initial apicid : 94 2025-05-07T19:43:04.8194108Z fpu : yes 2025-05-07T19:43:04.8194183Z fpu_exception : yes 2025-05-07T19:43:04.8194260Z cpuid level : 13 2025-05-07T19:43:04.8194337Z wp : yes 2025-05-07T19:43:04.8196301Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8196660Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8196746Z bogomips : 5999.99 2025-05-07T19:43:04.8196821Z clflush size : 64 2025-05-07T19:43:04.8196900Z cache_alignment : 64 2025-05-07T19:43:04.8197077Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8197160Z power management: 2025-05-07T19:43:04.8197164Z 2025-05-07T19:43:04.8197246Z processor : 40 2025-05-07T19:43:04.8197338Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8197435Z cpu family : 6 2025-05-07T19:43:04.8197513Z model : 85 2025-05-07T19:43:04.8197664Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8197755Z stepping : 7 2025-05-07T19:43:04.8197838Z microcode : 0x5003901 2025-05-07T19:43:04.8197921Z cpu MHz : 3169.229 2025-05-07T19:43:04.8197998Z cache size : 36608 KB 2025-05-07T19:43:04.8198090Z physical id : 1 2025-05-07T19:43:04.8198173Z siblings : 48 2025-05-07T19:43:04.8198254Z core id : 16 2025-05-07T19:43:04.8198342Z cpu cores : 24 2025-05-07T19:43:04.8198413Z apicid : 96 2025-05-07T19:43:04.8198499Z initial apicid : 96 2025-05-07T19:43:04.8198574Z fpu : yes 2025-05-07T19:43:04.8198662Z fpu_exception : yes 2025-05-07T19:43:04.8198738Z cpuid level : 13 2025-05-07T19:43:04.8198813Z wp : yes 2025-05-07T19:43:04.8200795Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8201155Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8201226Z bogomips : 5999.99 2025-05-07T19:43:04.8201304Z clflush size : 64 2025-05-07T19:43:04.8201381Z cache_alignment : 64 2025-05-07T19:43:04.8201498Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8201590Z power management: 2025-05-07T19:43:04.8201594Z 2025-05-07T19:43:04.8201668Z processor : 41 2025-05-07T19:43:04.8201746Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8201862Z cpu family : 6 2025-05-07T19:43:04.8201941Z model : 85 2025-05-07T19:43:04.8202082Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8202151Z stepping : 7 2025-05-07T19:43:04.8202237Z microcode : 0x5003901 2025-05-07T19:43:04.8202311Z cpu MHz : 2999.996 2025-05-07T19:43:04.8202383Z cache size : 36608 KB 2025-05-07T19:43:04.8202456Z physical id : 1 2025-05-07T19:43:04.8202534Z siblings : 48 2025-05-07T19:43:04.8202604Z core id : 17 2025-05-07T19:43:04.8202673Z cpu cores : 24 2025-05-07T19:43:04.8202750Z apicid : 98 2025-05-07T19:43:04.8202829Z initial apicid : 98 2025-05-07T19:43:04.8202897Z fpu : yes 2025-05-07T19:43:04.8202972Z fpu_exception : yes 2025-05-07T19:43:04.8203049Z cpuid level : 13 2025-05-07T19:43:04.8203115Z wp : yes 2025-05-07T19:43:04.8205082Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8205445Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8205518Z bogomips : 5999.99 2025-05-07T19:43:04.8205591Z clflush size : 64 2025-05-07T19:43:04.8205672Z cache_alignment : 64 2025-05-07T19:43:04.8205788Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8205860Z power management: 2025-05-07T19:43:04.8205907Z 2025-05-07T19:43:04.8205984Z processor : 42 2025-05-07T19:43:04.8206063Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8206133Z cpu family : 6 2025-05-07T19:43:04.8206205Z model : 85 2025-05-07T19:43:04.8206354Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8206427Z stepping : 7 2025-05-07T19:43:04.8206503Z microcode : 0x5003901 2025-05-07T19:43:04.8206580Z cpu MHz : 2999.996 2025-05-07T19:43:04.8206656Z cache size : 36608 KB 2025-05-07T19:43:04.8206731Z physical id : 1 2025-05-07T19:43:04.8206801Z siblings : 48 2025-05-07T19:43:04.8206878Z core id : 18 2025-05-07T19:43:04.8206953Z cpu cores : 24 2025-05-07T19:43:04.8207024Z apicid : 100 2025-05-07T19:43:04.8207112Z initial apicid : 100 2025-05-07T19:43:04.8207180Z fpu : yes 2025-05-07T19:43:04.8207260Z fpu_exception : yes 2025-05-07T19:43:04.8207335Z cpuid level : 13 2025-05-07T19:43:04.8207412Z wp : yes 2025-05-07T19:43:04.8209394Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8209754Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8209836Z bogomips : 5999.99 2025-05-07T19:43:04.8209911Z clflush size : 64 2025-05-07T19:43:04.8209990Z cache_alignment : 64 2025-05-07T19:43:04.8210117Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8210190Z power management: 2025-05-07T19:43:04.8210194Z 2025-05-07T19:43:04.8210271Z processor : 43 2025-05-07T19:43:04.8210365Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8210437Z cpu family : 6 2025-05-07T19:43:04.8210505Z model : 85 2025-05-07T19:43:04.8210707Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8210787Z stepping : 7 2025-05-07T19:43:04.8210867Z microcode : 0x5003901 2025-05-07T19:43:04.8210940Z cpu MHz : 2999.996 2025-05-07T19:43:04.8211026Z cache size : 36608 KB 2025-05-07T19:43:04.8211098Z physical id : 1 2025-05-07T19:43:04.8211166Z siblings : 48 2025-05-07T19:43:04.8211235Z core id : 19 2025-05-07T19:43:04.8211317Z cpu cores : 24 2025-05-07T19:43:04.8211385Z apicid : 102 2025-05-07T19:43:04.8211464Z initial apicid : 102 2025-05-07T19:43:04.8211534Z fpu : yes 2025-05-07T19:43:04.8211619Z fpu_exception : yes 2025-05-07T19:43:04.8211694Z cpuid level : 13 2025-05-07T19:43:04.8211761Z wp : yes 2025-05-07T19:43:04.8213738Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8214095Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8214169Z bogomips : 5999.99 2025-05-07T19:43:04.8214249Z clflush size : 64 2025-05-07T19:43:04.8214326Z cache_alignment : 64 2025-05-07T19:43:04.8214443Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8214527Z power management: 2025-05-07T19:43:04.8214532Z 2025-05-07T19:43:04.8214605Z processor : 44 2025-05-07T19:43:04.8215851Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8215939Z cpu family : 6 2025-05-07T19:43:04.8216011Z model : 85 2025-05-07T19:43:04.8216156Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8216236Z stepping : 7 2025-05-07T19:43:04.8216326Z microcode : 0x5003901 2025-05-07T19:43:04.8216396Z cpu MHz : 2999.996 2025-05-07T19:43:04.8216470Z cache size : 36608 KB 2025-05-07T19:43:04.8216543Z physical id : 1 2025-05-07T19:43:04.8216626Z siblings : 48 2025-05-07T19:43:04.8216693Z core id : 20 2025-05-07T19:43:04.8216764Z cpu cores : 24 2025-05-07T19:43:04.8216846Z apicid : 104 2025-05-07T19:43:04.8216923Z initial apicid : 104 2025-05-07T19:43:04.8216992Z fpu : yes 2025-05-07T19:43:04.8217065Z fpu_exception : yes 2025-05-07T19:43:04.8217144Z cpuid level : 13 2025-05-07T19:43:04.8217212Z wp : yes 2025-05-07T19:43:04.8219248Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8219784Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8219865Z bogomips : 5999.99 2025-05-07T19:43:04.8219943Z clflush size : 64 2025-05-07T19:43:04.8220034Z cache_alignment : 64 2025-05-07T19:43:04.8220159Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8220237Z power management: 2025-05-07T19:43:04.8220242Z 2025-05-07T19:43:04.8220325Z processor : 45 2025-05-07T19:43:04.8220413Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8220496Z cpu family : 6 2025-05-07T19:43:04.8220603Z model : 85 2025-05-07T19:43:04.8220764Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8220912Z stepping : 7 2025-05-07T19:43:04.8220994Z microcode : 0x5003901 2025-05-07T19:43:04.8221080Z cpu MHz : 2999.996 2025-05-07T19:43:04.8221160Z cache size : 36608 KB 2025-05-07T19:43:04.8221244Z physical id : 1 2025-05-07T19:43:04.8221319Z siblings : 48 2025-05-07T19:43:04.8221401Z core id : 21 2025-05-07T19:43:04.8221480Z cpu cores : 24 2025-05-07T19:43:04.8221555Z apicid : 106 2025-05-07T19:43:04.8221643Z initial apicid : 106 2025-05-07T19:43:04.8221716Z fpu : yes 2025-05-07T19:43:04.8221800Z fpu_exception : yes 2025-05-07T19:43:04.8221879Z cpuid level : 13 2025-05-07T19:43:04.8222119Z wp : yes 2025-05-07T19:43:04.8224272Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8224672Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8224750Z bogomips : 5999.99 2025-05-07T19:43:04.8224829Z clflush size : 64 2025-05-07T19:43:04.8224909Z cache_alignment : 64 2025-05-07T19:43:04.8225043Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8225124Z power management: 2025-05-07T19:43:04.8225129Z 2025-05-07T19:43:04.8225207Z processor : 46 2025-05-07T19:43:04.8225299Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8225375Z cpu family : 6 2025-05-07T19:43:04.8225449Z model : 85 2025-05-07T19:43:04.8225707Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8225815Z stepping : 7 2025-05-07T19:43:04.8225910Z microcode : 0x5003901 2025-05-07T19:43:04.8225997Z cpu MHz : 2999.996 2025-05-07T19:43:04.8226089Z cache size : 36608 KB 2025-05-07T19:43:04.8226174Z physical id : 1 2025-05-07T19:43:04.8226265Z siblings : 48 2025-05-07T19:43:04.8226347Z core id : 22 2025-05-07T19:43:04.8226445Z cpu cores : 24 2025-05-07T19:43:04.8226522Z apicid : 108 2025-05-07T19:43:04.8226614Z initial apicid : 108 2025-05-07T19:43:04.8226715Z fpu : yes 2025-05-07T19:43:04.8226804Z fpu_exception : yes 2025-05-07T19:43:04.8226891Z cpuid level : 13 2025-05-07T19:43:04.8226967Z wp : yes 2025-05-07T19:43:04.8229139Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8229532Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8229624Z bogomips : 5999.99 2025-05-07T19:43:04.8229704Z clflush size : 64 2025-05-07T19:43:04.8229785Z cache_alignment : 64 2025-05-07T19:43:04.8229914Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8230018Z power management: 2025-05-07T19:43:04.8230022Z 2025-05-07T19:43:04.8230102Z processor : 47 2025-05-07T19:43:04.8230191Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8230286Z cpu family : 6 2025-05-07T19:43:04.8230377Z model : 85 2025-05-07T19:43:04.8230541Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8230625Z stepping : 7 2025-05-07T19:43:04.8230730Z microcode : 0x5003901 2025-05-07T19:43:04.8230806Z cpu MHz : 3237.433 2025-05-07T19:43:04.8230960Z cache size : 36608 KB 2025-05-07T19:43:04.8231064Z physical id : 1 2025-05-07T19:43:04.8231146Z siblings : 48 2025-05-07T19:43:04.8231232Z core id : 23 2025-05-07T19:43:04.8231315Z cpu cores : 24 2025-05-07T19:43:04.8231422Z apicid : 110 2025-05-07T19:43:04.8231509Z initial apicid : 110 2025-05-07T19:43:04.8231584Z fpu : yes 2025-05-07T19:43:04.8231689Z fpu_exception : yes 2025-05-07T19:43:04.8231772Z cpuid level : 13 2025-05-07T19:43:04.8231858Z wp : yes 2025-05-07T19:43:04.8234112Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8234587Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8234663Z bogomips : 5999.99 2025-05-07T19:43:04.8234767Z clflush size : 64 2025-05-07T19:43:04.8234848Z cache_alignment : 64 2025-05-07T19:43:04.8234968Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8235056Z power management: 2025-05-07T19:43:04.8235060Z 2025-05-07T19:43:04.8235148Z processor : 48 2025-05-07T19:43:04.8235230Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8235305Z cpu family : 6 2025-05-07T19:43:04.8235396Z model : 85 2025-05-07T19:43:04.8235543Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8235662Z stepping : 7 2025-05-07T19:43:04.8235739Z microcode : 0x5003901 2025-05-07T19:43:04.8235833Z cpu MHz : 2999.996 2025-05-07T19:43:04.8235915Z cache size : 36608 KB 2025-05-07T19:43:04.8236001Z physical id : 0 2025-05-07T19:43:04.8236088Z siblings : 48 2025-05-07T19:43:04.8236160Z core id : 0 2025-05-07T19:43:04.8236239Z cpu cores : 24 2025-05-07T19:43:04.8236318Z apicid : 1 2025-05-07T19:43:04.8236413Z initial apicid : 1 2025-05-07T19:43:04.8236480Z fpu : yes 2025-05-07T19:43:04.8236562Z fpu_exception : yes 2025-05-07T19:43:04.8236668Z cpuid level : 13 2025-05-07T19:43:04.8236738Z wp : yes 2025-05-07T19:43:04.8238706Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8239081Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8239163Z bogomips : 5999.99 2025-05-07T19:43:04.8239244Z clflush size : 64 2025-05-07T19:43:04.8239344Z cache_alignment : 64 2025-05-07T19:43:04.8239467Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8239543Z power management: 2025-05-07T19:43:04.8239547Z 2025-05-07T19:43:04.8239634Z processor : 49 2025-05-07T19:43:04.8239731Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8239807Z cpu family : 6 2025-05-07T19:43:04.8239888Z model : 85 2025-05-07T19:43:04.8240049Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8240124Z stepping : 7 2025-05-07T19:43:04.8240209Z microcode : 0x5003901 2025-05-07T19:43:04.8240289Z cpu MHz : 2999.996 2025-05-07T19:43:04.8240380Z cache size : 36608 KB 2025-05-07T19:43:04.8240466Z physical id : 0 2025-05-07T19:43:04.8240592Z siblings : 48 2025-05-07T19:43:04.8240680Z core id : 1 2025-05-07T19:43:04.8240760Z cpu cores : 24 2025-05-07T19:43:04.8240830Z apicid : 3 2025-05-07T19:43:04.8240906Z initial apicid : 3 2025-05-07T19:43:04.8240979Z fpu : yes 2025-05-07T19:43:04.8241055Z fpu_exception : yes 2025-05-07T19:43:04.8241127Z cpuid level : 13 2025-05-07T19:43:04.8241198Z wp : yes 2025-05-07T19:43:04.8243169Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8243534Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8243627Z bogomips : 5999.99 2025-05-07T19:43:04.8243700Z clflush size : 64 2025-05-07T19:43:04.8243774Z cache_alignment : 64 2025-05-07T19:43:04.8243894Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8243970Z power management: 2025-05-07T19:43:04.8243974Z 2025-05-07T19:43:04.8244049Z processor : 50 2025-05-07T19:43:04.8244135Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8244226Z cpu family : 6 2025-05-07T19:43:04.8244298Z model : 85 2025-05-07T19:43:04.8244451Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8244544Z stepping : 7 2025-05-07T19:43:04.8244626Z microcode : 0x5003901 2025-05-07T19:43:04.8244750Z cpu MHz : 2999.996 2025-05-07T19:43:04.8244825Z cache size : 36608 KB 2025-05-07T19:43:04.8244926Z physical id : 0 2025-05-07T19:43:04.8245001Z siblings : 48 2025-05-07T19:43:04.8245075Z core id : 2 2025-05-07T19:43:04.8245146Z cpu cores : 24 2025-05-07T19:43:04.8245253Z apicid : 5 2025-05-07T19:43:04.8245333Z initial apicid : 5 2025-05-07T19:43:04.8245408Z fpu : yes 2025-05-07T19:43:04.8245502Z fpu_exception : yes 2025-05-07T19:43:04.8245581Z cpuid level : 13 2025-05-07T19:43:04.8245659Z wp : yes 2025-05-07T19:43:04.8247653Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8248013Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8248093Z bogomips : 5999.99 2025-05-07T19:43:04.8248193Z clflush size : 64 2025-05-07T19:43:04.8248268Z cache_alignment : 64 2025-05-07T19:43:04.8248393Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8248469Z power management: 2025-05-07T19:43:04.8248490Z 2025-05-07T19:43:04.8248569Z processor : 51 2025-05-07T19:43:04.8248649Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8248730Z cpu family : 6 2025-05-07T19:43:04.8248812Z model : 85 2025-05-07T19:43:04.8248953Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8249026Z stepping : 7 2025-05-07T19:43:04.8249114Z microcode : 0x5003901 2025-05-07T19:43:04.8249197Z cpu MHz : 1486.500 2025-05-07T19:43:04.8249275Z cache size : 36608 KB 2025-05-07T19:43:04.8249357Z physical id : 0 2025-05-07T19:43:04.8249431Z siblings : 48 2025-05-07T19:43:04.8249501Z core id : 3 2025-05-07T19:43:04.8249640Z cpu cores : 24 2025-05-07T19:43:04.8249718Z apicid : 7 2025-05-07T19:43:04.8249810Z initial apicid : 7 2025-05-07T19:43:04.8249881Z fpu : yes 2025-05-07T19:43:04.8249966Z fpu_exception : yes 2025-05-07T19:43:04.8250063Z cpuid level : 13 2025-05-07T19:43:04.8250137Z wp : yes 2025-05-07T19:43:04.8252105Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8252485Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8252565Z bogomips : 5999.99 2025-05-07T19:43:04.8252643Z clflush size : 64 2025-05-07T19:43:04.8252741Z cache_alignment : 64 2025-05-07T19:43:04.8252866Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8252948Z power management: 2025-05-07T19:43:04.8252952Z 2025-05-07T19:43:04.8253042Z processor : 52 2025-05-07T19:43:04.8253133Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8253213Z cpu family : 6 2025-05-07T19:43:04.8253289Z model : 85 2025-05-07T19:43:04.8253449Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8253530Z stepping : 7 2025-05-07T19:43:04.8253616Z microcode : 0x5003901 2025-05-07T19:43:04.8253700Z cpu MHz : 2999.996 2025-05-07T19:43:04.8253792Z cache size : 36608 KB 2025-05-07T19:43:04.8253920Z physical id : 0 2025-05-07T19:43:04.8253994Z siblings : 48 2025-05-07T19:43:04.8254090Z core id : 4 2025-05-07T19:43:04.8254172Z cpu cores : 24 2025-05-07T19:43:04.8254248Z apicid : 9 2025-05-07T19:43:04.8254338Z initial apicid : 9 2025-05-07T19:43:04.8254440Z fpu : yes 2025-05-07T19:43:04.8254530Z fpu_exception : yes 2025-05-07T19:43:04.8254606Z cpuid level : 13 2025-05-07T19:43:04.8254698Z wp : yes 2025-05-07T19:43:04.8256680Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8257037Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8257138Z bogomips : 5999.99 2025-05-07T19:43:04.8257219Z clflush size : 64 2025-05-07T19:43:04.8257301Z cache_alignment : 64 2025-05-07T19:43:04.8257432Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8257516Z power management: 2025-05-07T19:43:04.8257520Z 2025-05-07T19:43:04.8257599Z processor : 53 2025-05-07T19:43:04.8257692Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8257779Z cpu family : 6 2025-05-07T19:43:04.8257859Z model : 85 2025-05-07T19:43:04.8258006Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8258090Z stepping : 7 2025-05-07T19:43:04.8258166Z microcode : 0x5003901 2025-05-07T19:43:04.8258239Z cpu MHz : 1469.916 2025-05-07T19:43:04.8258319Z cache size : 36608 KB 2025-05-07T19:43:04.8258406Z physical id : 0 2025-05-07T19:43:04.8258480Z siblings : 48 2025-05-07T19:43:04.8258556Z core id : 5 2025-05-07T19:43:04.8258647Z cpu cores : 24 2025-05-07T19:43:04.8258726Z apicid : 11 2025-05-07T19:43:04.8258804Z initial apicid : 11 2025-05-07T19:43:04.8258945Z fpu : yes 2025-05-07T19:43:04.8259043Z fpu_exception : yes 2025-05-07T19:43:04.8259122Z cpuid level : 13 2025-05-07T19:43:04.8259271Z wp : yes 2025-05-07T19:43:04.8261577Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8261976Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8262062Z bogomips : 5999.99 2025-05-07T19:43:04.8262189Z clflush size : 64 2025-05-07T19:43:04.8262279Z cache_alignment : 64 2025-05-07T19:43:04.8262412Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8262523Z power management: 2025-05-07T19:43:04.8262527Z 2025-05-07T19:43:04.8262620Z processor : 54 2025-05-07T19:43:04.8262714Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8262799Z cpu family : 6 2025-05-07T19:43:04.8262906Z model : 85 2025-05-07T19:43:04.8263070Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8263152Z stepping : 7 2025-05-07T19:43:04.8263261Z microcode : 0x5003901 2025-05-07T19:43:04.8263341Z cpu MHz : 2999.996 2025-05-07T19:43:04.8263432Z cache size : 36608 KB 2025-05-07T19:43:04.8263516Z physical id : 0 2025-05-07T19:43:04.8263619Z siblings : 48 2025-05-07T19:43:04.8263699Z core id : 6 2025-05-07T19:43:04.8263837Z cpu cores : 24 2025-05-07T19:43:04.8263950Z apicid : 13 2025-05-07T19:43:04.8264036Z initial apicid : 13 2025-05-07T19:43:04.8264116Z fpu : yes 2025-05-07T19:43:04.8264213Z fpu_exception : yes 2025-05-07T19:43:04.8264320Z cpuid level : 13 2025-05-07T19:43:04.8264400Z wp : yes 2025-05-07T19:43:04.8266542Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8266960Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8267044Z bogomips : 5999.99 2025-05-07T19:43:04.8267127Z clflush size : 64 2025-05-07T19:43:04.8267246Z cache_alignment : 64 2025-05-07T19:43:04.8267384Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8267470Z power management: 2025-05-07T19:43:04.8267475Z 2025-05-07T19:43:04.8267579Z processor : 55 2025-05-07T19:43:04.8267675Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8267763Z cpu family : 6 2025-05-07T19:43:04.8267841Z model : 85 2025-05-07T19:43:04.8268022Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8268109Z stepping : 7 2025-05-07T19:43:04.8268195Z microcode : 0x5003901 2025-05-07T19:43:04.8268299Z cpu MHz : 3278.112 2025-05-07T19:43:04.8268388Z cache size : 36608 KB 2025-05-07T19:43:04.8268481Z physical id : 0 2025-05-07T19:43:04.8268561Z siblings : 48 2025-05-07T19:43:04.8268662Z core id : 7 2025-05-07T19:43:04.8268751Z cpu cores : 24 2025-05-07T19:43:04.8268844Z apicid : 15 2025-05-07T19:43:04.8268948Z initial apicid : 15 2025-05-07T19:43:04.8269025Z fpu : yes 2025-05-07T19:43:04.8269114Z fpu_exception : yes 2025-05-07T19:43:04.8269248Z cpuid level : 13 2025-05-07T19:43:04.8269344Z wp : yes 2025-05-07T19:43:04.8271483Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8271982Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8272073Z bogomips : 5999.99 2025-05-07T19:43:04.8272155Z clflush size : 64 2025-05-07T19:43:04.8272240Z cache_alignment : 64 2025-05-07T19:43:04.8272382Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8272463Z power management: 2025-05-07T19:43:04.8272467Z 2025-05-07T19:43:04.8272547Z processor : 56 2025-05-07T19:43:04.8272664Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8272742Z cpu family : 6 2025-05-07T19:43:04.8272816Z model : 85 2025-05-07T19:43:04.8272968Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8273063Z stepping : 7 2025-05-07T19:43:04.8273141Z microcode : 0x5003901 2025-05-07T19:43:04.8273217Z cpu MHz : 1444.388 2025-05-07T19:43:04.8273312Z cache size : 36608 KB 2025-05-07T19:43:04.8273388Z physical id : 0 2025-05-07T19:43:04.8273468Z siblings : 48 2025-05-07T19:43:04.8273548Z core id : 8 2025-05-07T19:43:04.8273632Z cpu cores : 24 2025-05-07T19:43:04.8273709Z apicid : 17 2025-05-07T19:43:04.8273843Z initial apicid : 17 2025-05-07T19:43:04.8273924Z fpu : yes 2025-05-07T19:43:04.8274022Z fpu_exception : yes 2025-05-07T19:43:04.8274100Z cpuid level : 13 2025-05-07T19:43:04.8274180Z wp : yes 2025-05-07T19:43:04.8276159Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8276514Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8276596Z bogomips : 5999.99 2025-05-07T19:43:04.8276695Z clflush size : 64 2025-05-07T19:43:04.8276771Z cache_alignment : 64 2025-05-07T19:43:04.8276895Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8276992Z power management: 2025-05-07T19:43:04.8276997Z 2025-05-07T19:43:04.8277069Z processor : 57 2025-05-07T19:43:04.8277155Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8277243Z cpu family : 6 2025-05-07T19:43:04.8277307Z model : 85 2025-05-07T19:43:04.8277457Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8277531Z stepping : 7 2025-05-07T19:43:04.8277629Z microcode : 0x5003901 2025-05-07T19:43:04.8277709Z cpu MHz : 3572.209 2025-05-07T19:43:04.8277786Z cache size : 36608 KB 2025-05-07T19:43:04.8277861Z physical id : 0 2025-05-07T19:43:04.8277951Z siblings : 48 2025-05-07T19:43:04.8278031Z core id : 9 2025-05-07T19:43:04.8278104Z cpu cores : 24 2025-05-07T19:43:04.8278191Z apicid : 19 2025-05-07T19:43:04.8278273Z initial apicid : 19 2025-05-07T19:43:04.8278351Z fpu : yes 2025-05-07T19:43:04.8278438Z fpu_exception : yes 2025-05-07T19:43:04.8278544Z cpuid level : 13 2025-05-07T19:43:04.8278623Z wp : yes 2025-05-07T19:43:04.8280588Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8281004Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8281089Z bogomips : 5999.99 2025-05-07T19:43:04.8281171Z clflush size : 64 2025-05-07T19:43:04.8281267Z cache_alignment : 64 2025-05-07T19:43:04.8281388Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8281473Z power management: 2025-05-07T19:43:04.8281481Z 2025-05-07T19:43:04.8281578Z processor : 58 2025-05-07T19:43:04.8281667Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8281747Z cpu family : 6 2025-05-07T19:43:04.8281826Z model : 85 2025-05-07T19:43:04.8281994Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8282078Z stepping : 7 2025-05-07T19:43:04.8282160Z microcode : 0x5003901 2025-05-07T19:43:04.8282254Z cpu MHz : 3249.631 2025-05-07T19:43:04.8282335Z cache size : 36608 KB 2025-05-07T19:43:04.8282418Z physical id : 0 2025-05-07T19:43:04.8282495Z siblings : 48 2025-05-07T19:43:04.8282585Z core id : 10 2025-05-07T19:43:04.8282663Z cpu cores : 24 2025-05-07T19:43:04.8282743Z apicid : 21 2025-05-07T19:43:04.8282837Z initial apicid : 21 2025-05-07T19:43:04.8282914Z fpu : yes 2025-05-07T19:43:04.8282996Z fpu_exception : yes 2025-05-07T19:43:04.8283135Z cpuid level : 13 2025-05-07T19:43:04.8283219Z wp : yes 2025-05-07T19:43:04.8285195Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8285576Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8285655Z bogomips : 5999.99 2025-05-07T19:43:04.8285733Z clflush size : 64 2025-05-07T19:43:04.8285818Z cache_alignment : 64 2025-05-07T19:43:04.8285963Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8286052Z power management: 2025-05-07T19:43:04.8286057Z 2025-05-07T19:43:04.8286144Z processor : 59 2025-05-07T19:43:04.8286262Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8286337Z cpu family : 6 2025-05-07T19:43:04.8286411Z model : 85 2025-05-07T19:43:04.8286563Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8286658Z stepping : 7 2025-05-07T19:43:04.8286739Z microcode : 0x5003901 2025-05-07T19:43:04.8286816Z cpu MHz : 2999.996 2025-05-07T19:43:04.8286911Z cache size : 36608 KB 2025-05-07T19:43:04.8286989Z physical id : 0 2025-05-07T19:43:04.8287063Z siblings : 48 2025-05-07T19:43:04.8287139Z core id : 11 2025-05-07T19:43:04.8287233Z cpu cores : 24 2025-05-07T19:43:04.8287313Z apicid : 23 2025-05-07T19:43:04.8287392Z initial apicid : 23 2025-05-07T19:43:04.8287480Z fpu : yes 2025-05-07T19:43:04.8287559Z fpu_exception : yes 2025-05-07T19:43:04.8287635Z cpuid level : 13 2025-05-07T19:43:04.8287708Z wp : yes 2025-05-07T19:43:04.8289704Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8290108Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8290199Z bogomips : 5999.99 2025-05-07T19:43:04.8290277Z clflush size : 64 2025-05-07T19:43:04.8290360Z cache_alignment : 64 2025-05-07T19:43:04.8290481Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8290579Z power management: 2025-05-07T19:43:04.8290583Z 2025-05-07T19:43:04.8290665Z processor : 60 2025-05-07T19:43:04.8290761Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8290853Z cpu family : 6 2025-05-07T19:43:04.8290937Z model : 85 2025-05-07T19:43:04.8291087Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8291166Z stepping : 7 2025-05-07T19:43:04.8291264Z microcode : 0x5003901 2025-05-07T19:43:04.8291348Z cpu MHz : 1618.415 2025-05-07T19:43:04.8291427Z cache size : 36608 KB 2025-05-07T19:43:04.8291523Z physical id : 0 2025-05-07T19:43:04.8291603Z siblings : 48 2025-05-07T19:43:04.8291674Z core id : 12 2025-05-07T19:43:04.8291757Z cpu cores : 24 2025-05-07T19:43:04.8291854Z apicid : 25 2025-05-07T19:43:04.8291939Z initial apicid : 25 2025-05-07T19:43:04.8292012Z fpu : yes 2025-05-07T19:43:04.8292117Z fpu_exception : yes 2025-05-07T19:43:04.8292203Z cpuid level : 13 2025-05-07T19:43:04.8292275Z wp : yes 2025-05-07T19:43:04.8294306Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8294672Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8294758Z bogomips : 5999.99 2025-05-07T19:43:04.8294846Z clflush size : 64 2025-05-07T19:43:04.8294933Z cache_alignment : 64 2025-05-07T19:43:04.8295064Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8295150Z power management: 2025-05-07T19:43:04.8295155Z 2025-05-07T19:43:04.8295241Z processor : 61 2025-05-07T19:43:04.8295330Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8295411Z cpu family : 6 2025-05-07T19:43:04.8295500Z model : 85 2025-05-07T19:43:04.8295648Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8295731Z stepping : 7 2025-05-07T19:43:04.8295816Z microcode : 0x5003901 2025-05-07T19:43:04.8295919Z cpu MHz : 1416.720 2025-05-07T19:43:04.8296006Z cache size : 36608 KB 2025-05-07T19:43:04.8296095Z physical id : 0 2025-05-07T19:43:04.8296208Z siblings : 48 2025-05-07T19:43:04.8296295Z core id : 13 2025-05-07T19:43:04.8296385Z cpu cores : 24 2025-05-07T19:43:04.8296474Z apicid : 27 2025-05-07T19:43:04.8296592Z initial apicid : 27 2025-05-07T19:43:04.8296679Z fpu : yes 2025-05-07T19:43:04.8296774Z fpu_exception : yes 2025-05-07T19:43:04.8296863Z cpuid level : 13 2025-05-07T19:43:04.8296980Z wp : yes 2025-05-07T19:43:04.8298966Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8299483Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8299740Z bogomips : 5999.99 2025-05-07T19:43:04.8299838Z clflush size : 64 2025-05-07T19:43:04.8299965Z cache_alignment : 64 2025-05-07T19:43:04.8300113Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8300213Z power management: 2025-05-07T19:43:04.8300217Z 2025-05-07T19:43:04.8300318Z processor : 62 2025-05-07T19:43:04.8300449Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8300596Z cpu family : 6 2025-05-07T19:43:04.8300687Z model : 85 2025-05-07T19:43:04.8300890Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8300984Z stepping : 7 2025-05-07T19:43:04.8301083Z microcode : 0x5003901 2025-05-07T19:43:04.8301178Z cpu MHz : 1452.088 2025-05-07T19:43:04.8301304Z cache size : 36608 KB 2025-05-07T19:43:04.8301403Z physical id : 0 2025-05-07T19:43:04.8301499Z siblings : 48 2025-05-07T19:43:04.8301619Z core id : 14 2025-05-07T19:43:04.8301715Z cpu cores : 24 2025-05-07T19:43:04.8301814Z apicid : 29 2025-05-07T19:43:04.8301916Z initial apicid : 29 2025-05-07T19:43:04.8302034Z fpu : yes 2025-05-07T19:43:04.8302138Z fpu_exception : yes 2025-05-07T19:43:04.8302233Z cpuid level : 13 2025-05-07T19:43:04.8302326Z wp : yes 2025-05-07T19:43:04.8304561Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8304970Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8305094Z bogomips : 5999.99 2025-05-07T19:43:04.8305191Z clflush size : 64 2025-05-07T19:43:04.8305287Z cache_alignment : 64 2025-05-07T19:43:04.8305435Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8305553Z power management: 2025-05-07T19:43:04.8305558Z 2025-05-07T19:43:04.8305654Z processor : 63 2025-05-07T19:43:04.8305763Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8305885Z cpu family : 6 2025-05-07T19:43:04.8305977Z model : 85 2025-05-07T19:43:04.8306150Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8306275Z stepping : 7 2025-05-07T19:43:04.8306369Z microcode : 0x5003901 2025-05-07T19:43:04.8306460Z cpu MHz : 3247.617 2025-05-07T19:43:04.8306552Z cache size : 36608 KB 2025-05-07T19:43:04.8306670Z physical id : 0 2025-05-07T19:43:04.8306757Z siblings : 48 2025-05-07T19:43:04.8306845Z core id : 15 2025-05-07T19:43:04.8306933Z cpu cores : 24 2025-05-07T19:43:04.8307047Z apicid : 31 2025-05-07T19:43:04.8307142Z initial apicid : 31 2025-05-07T19:43:04.8307230Z fpu : yes 2025-05-07T19:43:04.8307349Z fpu_exception : yes 2025-05-07T19:43:04.8307441Z cpuid level : 13 2025-05-07T19:43:04.8307532Z wp : yes 2025-05-07T19:43:04.8309698Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8310148Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8310244Z bogomips : 5999.99 2025-05-07T19:43:04.8310365Z clflush size : 64 2025-05-07T19:43:04.8310465Z cache_alignment : 64 2025-05-07T19:43:04.8310604Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8310696Z power management: 2025-05-07T19:43:04.8310723Z 2025-05-07T19:43:04.8310816Z processor : 64 2025-05-07T19:43:04.8310917Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8311011Z cpu family : 6 2025-05-07T19:43:04.8311127Z model : 85 2025-05-07T19:43:04.8311302Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8311401Z stepping : 7 2025-05-07T19:43:04.8311497Z microcode : 0x5003901 2025-05-07T19:43:04.8311616Z cpu MHz : 1488.723 2025-05-07T19:43:04.8311715Z cache size : 36608 KB 2025-05-07T19:43:04.8311920Z physical id : 0 2025-05-07T19:43:04.8312038Z siblings : 48 2025-05-07T19:43:04.8312122Z core id : 16 2025-05-07T19:43:04.8312210Z cpu cores : 24 2025-05-07T19:43:04.8312295Z apicid : 33 2025-05-07T19:43:04.8312405Z initial apicid : 33 2025-05-07T19:43:04.8312489Z fpu : yes 2025-05-07T19:43:04.8312582Z fpu_exception : yes 2025-05-07T19:43:04.8312689Z cpuid level : 13 2025-05-07T19:43:04.8312772Z wp : yes 2025-05-07T19:43:04.8314797Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8315194Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8315284Z bogomips : 5999.99 2025-05-07T19:43:04.8315375Z clflush size : 64 2025-05-07T19:43:04.8315488Z cache_alignment : 64 2025-05-07T19:43:04.8315622Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8315711Z power management: 2025-05-07T19:43:04.8315715Z 2025-05-07T19:43:04.8315805Z processor : 65 2025-05-07T19:43:04.8315927Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8316014Z cpu family : 6 2025-05-07T19:43:04.8316104Z model : 85 2025-05-07T19:43:04.8316289Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8316376Z stepping : 7 2025-05-07T19:43:04.8316472Z microcode : 0x5003901 2025-05-07T19:43:04.8316561Z cpu MHz : 1506.166 2025-05-07T19:43:04.8316674Z cache size : 36608 KB 2025-05-07T19:43:04.8316761Z physical id : 0 2025-05-07T19:43:04.8316846Z siblings : 48 2025-05-07T19:43:04.8316954Z core id : 17 2025-05-07T19:43:04.8317040Z cpu cores : 24 2025-05-07T19:43:04.8317124Z apicid : 35 2025-05-07T19:43:04.8317213Z initial apicid : 35 2025-05-07T19:43:04.8317324Z fpu : yes 2025-05-07T19:43:04.8317415Z fpu_exception : yes 2025-05-07T19:43:04.8317502Z cpuid level : 13 2025-05-07T19:43:04.8317610Z wp : yes 2025-05-07T19:43:04.8319592Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8320004Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8320115Z bogomips : 5999.99 2025-05-07T19:43:04.8320201Z clflush size : 64 2025-05-07T19:43:04.8320290Z cache_alignment : 64 2025-05-07T19:43:04.8320445Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8320535Z power management: 2025-05-07T19:43:04.8320540Z 2025-05-07T19:43:04.8320626Z processor : 66 2025-05-07T19:43:04.8320724Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8320831Z cpu family : 6 2025-05-07T19:43:04.8320916Z model : 85 2025-05-07T19:43:04.8321078Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8321187Z stepping : 7 2025-05-07T19:43:04.8321275Z microcode : 0x5003901 2025-05-07T19:43:04.8321368Z cpu MHz : 1609.775 2025-05-07T19:43:04.8321453Z cache size : 36608 KB 2025-05-07T19:43:04.8321562Z physical id : 0 2025-05-07T19:43:04.8321647Z siblings : 48 2025-05-07T19:43:04.8321895Z core id : 18 2025-05-07T19:43:04.8322179Z cpu cores : 24 2025-05-07T19:43:04.8322303Z apicid : 37 2025-05-07T19:43:04.8322415Z initial apicid : 37 2025-05-07T19:43:04.8322666Z fpu : yes 2025-05-07T19:43:04.8322785Z fpu_exception : yes 2025-05-07T19:43:04.8322877Z cpuid level : 13 2025-05-07T19:43:04.8322969Z wp : yes 2025-05-07T19:43:04.8325267Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8325673Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8325768Z bogomips : 5999.99 2025-05-07T19:43:04.8325889Z clflush size : 64 2025-05-07T19:43:04.8325987Z cache_alignment : 64 2025-05-07T19:43:04.8326126Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8326248Z power management: 2025-05-07T19:43:04.8326252Z 2025-05-07T19:43:04.8326343Z processor : 67 2025-05-07T19:43:04.8326442Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8326533Z cpu family : 6 2025-05-07T19:43:04.8326646Z model : 85 2025-05-07T19:43:04.8326816Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8326905Z stepping : 7 2025-05-07T19:43:04.8327021Z microcode : 0x5003901 2025-05-07T19:43:04.8327110Z cpu MHz : 2999.996 2025-05-07T19:43:04.8327205Z cache size : 36608 KB 2025-05-07T19:43:04.8327298Z physical id : 0 2025-05-07T19:43:04.8327407Z siblings : 48 2025-05-07T19:43:04.8327498Z core id : 19 2025-05-07T19:43:04.8327587Z cpu cores : 24 2025-05-07T19:43:04.8327699Z apicid : 39 2025-05-07T19:43:04.8327794Z initial apicid : 39 2025-05-07T19:43:04.8327883Z fpu : yes 2025-05-07T19:43:04.8327978Z fpu_exception : yes 2025-05-07T19:43:04.8328092Z cpuid level : 13 2025-05-07T19:43:04.8328181Z wp : yes 2025-05-07T19:43:04.8330326Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8330819Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8330919Z bogomips : 5999.99 2025-05-07T19:43:04.8331013Z clflush size : 64 2025-05-07T19:43:04.8331134Z cache_alignment : 64 2025-05-07T19:43:04.8331276Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8331375Z power management: 2025-05-07T19:43:04.8331379Z 2025-05-07T19:43:04.8331503Z processor : 68 2025-05-07T19:43:04.8331608Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8331705Z cpu family : 6 2025-05-07T19:43:04.8331795Z model : 85 2025-05-07T19:43:04.8331994Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8332088Z stepping : 7 2025-05-07T19:43:04.8332189Z microcode : 0x5003901 2025-05-07T19:43:04.8332307Z cpu MHz : 1466.792 2025-05-07T19:43:04.8332401Z cache size : 36608 KB 2025-05-07T19:43:04.8332497Z physical id : 0 2025-05-07T19:43:04.8332585Z siblings : 48 2025-05-07T19:43:04.8332696Z core id : 20 2025-05-07T19:43:04.8332789Z cpu cores : 24 2025-05-07T19:43:04.8332879Z apicid : 41 2025-05-07T19:43:04.8333004Z initial apicid : 41 2025-05-07T19:43:04.8333091Z fpu : yes 2025-05-07T19:43:04.8333187Z fpu_exception : yes 2025-05-07T19:43:04.8333279Z cpuid level : 13 2025-05-07T19:43:04.8333390Z wp : yes 2025-05-07T19:43:04.8335751Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8336125Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8336232Z bogomips : 5999.99 2025-05-07T19:43:04.8336323Z clflush size : 64 2025-05-07T19:43:04.8336414Z cache_alignment : 64 2025-05-07T19:43:04.8336568Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8336660Z power management: 2025-05-07T19:43:04.8336664Z 2025-05-07T19:43:04.8336758Z processor : 69 2025-05-07T19:43:04.8336888Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8336979Z cpu family : 6 2025-05-07T19:43:04.8337068Z model : 85 2025-05-07T19:43:04.8337230Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8337350Z stepping : 7 2025-05-07T19:43:04.8337446Z microcode : 0x5003901 2025-05-07T19:43:04.8337542Z cpu MHz : 1465.103 2025-05-07T19:43:04.8337664Z cache size : 36608 KB 2025-05-07T19:43:04.8337758Z physical id : 0 2025-05-07T19:43:04.8337852Z siblings : 48 2025-05-07T19:43:04.8337939Z core id : 21 2025-05-07T19:43:04.8338057Z cpu cores : 24 2025-05-07T19:43:04.8338143Z apicid : 43 2025-05-07T19:43:04.8338239Z initial apicid : 43 2025-05-07T19:43:04.8338325Z fpu : yes 2025-05-07T19:43:04.8338449Z fpu_exception : yes 2025-05-07T19:43:04.8338544Z cpuid level : 13 2025-05-07T19:43:04.8338631Z wp : yes 2025-05-07T19:43:04.8340970Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8341824Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8341918Z bogomips : 5999.99 2025-05-07T19:43:04.8342044Z clflush size : 64 2025-05-07T19:43:04.8342145Z cache_alignment : 64 2025-05-07T19:43:04.8342288Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8342415Z power management: 2025-05-07T19:43:04.8342420Z 2025-05-07T19:43:04.8342519Z processor : 70 2025-05-07T19:43:04.8342626Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8342751Z cpu family : 6 2025-05-07T19:43:04.8342844Z model : 85 2025-05-07T19:43:04.8343022Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8343116Z stepping : 7 2025-05-07T19:43:04.8343235Z microcode : 0x5003901 2025-05-07T19:43:04.8343326Z cpu MHz : 1467.915 2025-05-07T19:43:04.8343429Z cache size : 36608 KB 2025-05-07T19:43:04.8343551Z physical id : 0 2025-05-07T19:43:04.8343646Z siblings : 48 2025-05-07T19:43:04.8343738Z core id : 22 2025-05-07T19:43:04.8343836Z cpu cores : 24 2025-05-07T19:43:04.8343953Z apicid : 45 2025-05-07T19:43:04.8344045Z initial apicid : 45 2025-05-07T19:43:04.8344135Z fpu : yes 2025-05-07T19:43:04.8344231Z fpu_exception : yes 2025-05-07T19:43:04.8344352Z cpuid level : 13 2025-05-07T19:43:04.8344444Z wp : yes 2025-05-07T19:43:04.8346636Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8347059Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8347158Z bogomips : 5999.99 2025-05-07T19:43:04.8347256Z clflush size : 64 2025-05-07T19:43:04.8347374Z cache_alignment : 64 2025-05-07T19:43:04.8347512Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8347611Z power management: 2025-05-07T19:43:04.8347615Z 2025-05-07T19:43:04.8347736Z processor : 71 2025-05-07T19:43:04.8347836Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8347928Z cpu family : 6 2025-05-07T19:43:04.8348019Z model : 85 2025-05-07T19:43:04.8348223Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8348314Z stepping : 7 2025-05-07T19:43:04.8348410Z microcode : 0x5003901 2025-05-07T19:43:04.8348522Z cpu MHz : 1503.950 2025-05-07T19:43:04.8348617Z cache size : 36608 KB 2025-05-07T19:43:04.8348719Z physical id : 0 2025-05-07T19:43:04.8362180Z siblings : 48 2025-05-07T19:43:04.8362314Z core id : 23 2025-05-07T19:43:04.8362396Z cpu cores : 24 2025-05-07T19:43:04.8362497Z apicid : 47 2025-05-07T19:43:04.8362583Z initial apicid : 47 2025-05-07T19:43:04.8362669Z fpu : yes 2025-05-07T19:43:04.8362758Z fpu_exception : yes 2025-05-07T19:43:04.8362837Z cpuid level : 13 2025-05-07T19:43:04.8362989Z wp : yes 2025-05-07T19:43:04.8365014Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8365380Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8365575Z bogomips : 5999.99 2025-05-07T19:43:04.8365675Z clflush size : 64 2025-05-07T19:43:04.8365755Z cache_alignment : 64 2025-05-07T19:43:04.8365878Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8365962Z power management: 2025-05-07T19:43:04.8365987Z 2025-05-07T19:43:04.8366066Z processor : 72 2025-05-07T19:43:04.8366153Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8366227Z cpu family : 6 2025-05-07T19:43:04.8366318Z model : 85 2025-05-07T19:43:04.8366473Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8366557Z stepping : 7 2025-05-07T19:43:04.8366648Z microcode : 0x5003901 2025-05-07T19:43:04.8366744Z cpu MHz : 2999.996 2025-05-07T19:43:04.8366820Z cache size : 36608 KB 2025-05-07T19:43:04.8366896Z physical id : 1 2025-05-07T19:43:04.8366992Z siblings : 48 2025-05-07T19:43:04.8367065Z core id : 0 2025-05-07T19:43:04.8367142Z cpu cores : 24 2025-05-07T19:43:04.8367215Z apicid : 65 2025-05-07T19:43:04.8367307Z initial apicid : 65 2025-05-07T19:43:04.8367379Z fpu : yes 2025-05-07T19:43:04.8367454Z fpu_exception : yes 2025-05-07T19:43:04.8367541Z cpuid level : 13 2025-05-07T19:43:04.8367613Z wp : yes 2025-05-07T19:43:04.8369598Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8370028Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8370111Z bogomips : 5999.99 2025-05-07T19:43:04.8370190Z clflush size : 64 2025-05-07T19:43:04.8370286Z cache_alignment : 64 2025-05-07T19:43:04.8370409Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8370486Z power management: 2025-05-07T19:43:04.8370491Z 2025-05-07T19:43:04.8370564Z processor : 73 2025-05-07T19:43:04.8370671Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8370743Z cpu family : 6 2025-05-07T19:43:04.8370818Z model : 85 2025-05-07T19:43:04.8370986Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8371060Z stepping : 7 2025-05-07T19:43:04.8371134Z microcode : 0x5003901 2025-05-07T19:43:04.8371209Z cpu MHz : 3269.511 2025-05-07T19:43:04.8371296Z cache size : 36608 KB 2025-05-07T19:43:04.8371374Z physical id : 1 2025-05-07T19:43:04.8371447Z siblings : 48 2025-05-07T19:43:04.8371541Z core id : 1 2025-05-07T19:43:04.8371618Z cpu cores : 24 2025-05-07T19:43:04.8371691Z apicid : 67 2025-05-07T19:43:04.8371775Z initial apicid : 67 2025-05-07T19:43:04.8371865Z fpu : yes 2025-05-07T19:43:04.8371950Z fpu_exception : yes 2025-05-07T19:43:04.8372025Z cpuid level : 13 2025-05-07T19:43:04.8372117Z wp : yes 2025-05-07T19:43:04.8374091Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8374455Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8374552Z bogomips : 5999.99 2025-05-07T19:43:04.8374696Z clflush size : 64 2025-05-07T19:43:04.8374774Z cache_alignment : 64 2025-05-07T19:43:04.8374908Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8374985Z power management: 2025-05-07T19:43:04.8374989Z 2025-05-07T19:43:04.8375067Z processor : 74 2025-05-07T19:43:04.8375151Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8375244Z cpu family : 6 2025-05-07T19:43:04.8375319Z model : 85 2025-05-07T19:43:04.8375465Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8375560Z stepping : 7 2025-05-07T19:43:04.8375641Z microcode : 0x5003901 2025-05-07T19:43:04.8375718Z cpu MHz : 3158.139 2025-05-07T19:43:04.8375796Z cache size : 36608 KB 2025-05-07T19:43:04.8375901Z physical id : 1 2025-05-07T19:43:04.8375989Z siblings : 48 2025-05-07T19:43:04.8376065Z core id : 2 2025-05-07T19:43:04.8376152Z cpu cores : 24 2025-05-07T19:43:04.8376226Z apicid : 69 2025-05-07T19:43:04.8376308Z initial apicid : 69 2025-05-07T19:43:04.8376382Z fpu : yes 2025-05-07T19:43:04.8376466Z fpu_exception : yes 2025-05-07T19:43:04.8376540Z cpuid level : 13 2025-05-07T19:43:04.8376609Z wp : yes 2025-05-07T19:43:04.8378595Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8379003Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8379080Z bogomips : 5999.99 2025-05-07T19:43:04.8379247Z clflush size : 64 2025-05-07T19:43:04.8379336Z cache_alignment : 64 2025-05-07T19:43:04.8379463Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8379555Z power management: 2025-05-07T19:43:04.8379560Z 2025-05-07T19:43:04.8379804Z processor : 75 2025-05-07T19:43:04.8379900Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8379981Z cpu family : 6 2025-05-07T19:43:04.8380073Z model : 85 2025-05-07T19:43:04.8380232Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8380320Z stepping : 7 2025-05-07T19:43:04.8380415Z microcode : 0x5003901 2025-05-07T19:43:04.8380553Z cpu MHz : 2999.996 2025-05-07T19:43:04.8380633Z cache size : 36608 KB 2025-05-07T19:43:04.8380716Z physical id : 1 2025-05-07T19:43:04.8380799Z siblings : 48 2025-05-07T19:43:04.8380879Z core id : 3 2025-05-07T19:43:04.8380957Z cpu cores : 24 2025-05-07T19:43:04.8381039Z apicid : 71 2025-05-07T19:43:04.8381130Z initial apicid : 71 2025-05-07T19:43:04.8381208Z fpu : yes 2025-05-07T19:43:04.8381291Z fpu_exception : yes 2025-05-07T19:43:04.8381380Z cpuid level : 13 2025-05-07T19:43:04.8381457Z wp : yes 2025-05-07T19:43:04.8383595Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8383991Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8384072Z bogomips : 5999.99 2025-05-07T19:43:04.8384155Z clflush size : 64 2025-05-07T19:43:04.8384252Z cache_alignment : 64 2025-05-07T19:43:04.8384375Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8384514Z power management: 2025-05-07T19:43:04.8384519Z 2025-05-07T19:43:04.8384621Z processor : 76 2025-05-07T19:43:04.8384710Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8384791Z cpu family : 6 2025-05-07T19:43:04.8384864Z model : 85 2025-05-07T19:43:04.8385037Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8385115Z stepping : 7 2025-05-07T19:43:04.8385194Z microcode : 0x5003901 2025-05-07T19:43:04.8385286Z cpu MHz : 3542.924 2025-05-07T19:43:04.8385366Z cache size : 36608 KB 2025-05-07T19:43:04.8385443Z physical id : 1 2025-05-07T19:43:04.8385524Z siblings : 48 2025-05-07T19:43:04.8385615Z core id : 4 2025-05-07T19:43:04.8385692Z cpu cores : 24 2025-05-07T19:43:04.8385769Z apicid : 73 2025-05-07T19:43:04.8385851Z initial apicid : 73 2025-05-07T19:43:04.8385950Z fpu : yes 2025-05-07T19:43:04.8386033Z fpu_exception : yes 2025-05-07T19:43:04.8386107Z cpuid level : 13 2025-05-07T19:43:04.8386194Z wp : yes 2025-05-07T19:43:04.8388327Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8388714Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8388810Z bogomips : 5999.99 2025-05-07T19:43:04.8388986Z clflush size : 64 2025-05-07T19:43:04.8389071Z cache_alignment : 64 2025-05-07T19:43:04.8389219Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8389308Z power management: 2025-05-07T19:43:04.8389313Z 2025-05-07T19:43:04.8389390Z processor : 77 2025-05-07T19:43:04.8389493Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8389576Z cpu family : 6 2025-05-07T19:43:04.8389651Z model : 85 2025-05-07T19:43:04.8389803Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8389902Z stepping : 7 2025-05-07T19:43:04.8389985Z microcode : 0x5003901 2025-05-07T19:43:04.8390068Z cpu MHz : 3224.179 2025-05-07T19:43:04.8390149Z cache size : 36608 KB 2025-05-07T19:43:04.8390252Z physical id : 1 2025-05-07T19:43:04.8390335Z siblings : 48 2025-05-07T19:43:04.8390410Z core id : 5 2025-05-07T19:43:04.8390498Z cpu cores : 24 2025-05-07T19:43:04.8390576Z apicid : 75 2025-05-07T19:43:04.8390660Z initial apicid : 75 2025-05-07T19:43:04.8390733Z fpu : yes 2025-05-07T19:43:04.8390830Z fpu_exception : yes 2025-05-07T19:43:04.8390909Z cpuid level : 13 2025-05-07T19:43:04.8390984Z wp : yes 2025-05-07T19:43:04.8393131Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8393500Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8393578Z bogomips : 5999.99 2025-05-07T19:43:04.8393662Z clflush size : 64 2025-05-07T19:43:04.8393742Z cache_alignment : 64 2025-05-07T19:43:04.8393867Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8393954Z power management: 2025-05-07T19:43:04.8394004Z 2025-05-07T19:43:04.8394077Z processor : 78 2025-05-07T19:43:04.8394159Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8394233Z cpu family : 6 2025-05-07T19:43:04.8394311Z model : 85 2025-05-07T19:43:04.8394454Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8394529Z stepping : 7 2025-05-07T19:43:04.8394617Z microcode : 0x5003901 2025-05-07T19:43:04.8394691Z cpu MHz : 3243.989 2025-05-07T19:43:04.8394771Z cache size : 36608 KB 2025-05-07T19:43:04.8394848Z physical id : 1 2025-05-07T19:43:04.8394929Z siblings : 48 2025-05-07T19:43:04.8394999Z core id : 6 2025-05-07T19:43:04.8395073Z cpu cores : 24 2025-05-07T19:43:04.8395156Z apicid : 77 2025-05-07T19:43:04.8395229Z initial apicid : 77 2025-05-07T19:43:04.8395301Z fpu : yes 2025-05-07T19:43:04.8395378Z fpu_exception : yes 2025-05-07T19:43:04.8395460Z cpuid level : 13 2025-05-07T19:43:04.8395530Z wp : yes 2025-05-07T19:43:04.8397492Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8397861Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8397935Z bogomips : 5999.99 2025-05-07T19:43:04.8398009Z clflush size : 64 2025-05-07T19:43:04.8398106Z cache_alignment : 64 2025-05-07T19:43:04.8398266Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8398343Z power management: 2025-05-07T19:43:04.8398347Z 2025-05-07T19:43:04.8398437Z processor : 79 2025-05-07T19:43:04.8398521Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8398593Z cpu family : 6 2025-05-07T19:43:04.8398666Z model : 85 2025-05-07T19:43:04.8398819Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8398888Z stepping : 7 2025-05-07T19:43:04.8398967Z microcode : 0x5003901 2025-05-07T19:43:04.8399059Z cpu MHz : 3272.411 2025-05-07T19:43:04.8399132Z cache size : 36608 KB 2025-05-07T19:43:04.8399203Z physical id : 1 2025-05-07T19:43:04.8399274Z siblings : 48 2025-05-07T19:43:04.8399357Z core id : 7 2025-05-07T19:43:04.8399428Z cpu cores : 24 2025-05-07T19:43:04.8399499Z apicid : 79 2025-05-07T19:43:04.8399581Z initial apicid : 79 2025-05-07T19:43:04.8399648Z fpu : yes 2025-05-07T19:43:04.8399724Z fpu_exception : yes 2025-05-07T19:43:04.8399799Z cpuid level : 13 2025-05-07T19:43:04.8399886Z wp : yes 2025-05-07T19:43:04.8401850Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8402216Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8402293Z bogomips : 5999.99 2025-05-07T19:43:04.8402366Z clflush size : 64 2025-05-07T19:43:04.8402448Z cache_alignment : 64 2025-05-07T19:43:04.8402580Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8402657Z power management: 2025-05-07T19:43:04.8402661Z 2025-05-07T19:43:04.8402737Z processor : 80 2025-05-07T19:43:04.8402829Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8402949Z cpu family : 6 2025-05-07T19:43:04.8403016Z model : 85 2025-05-07T19:43:04.8403168Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8403247Z stepping : 7 2025-05-07T19:43:04.8403324Z microcode : 0x5003901 2025-05-07T19:43:04.8403401Z cpu MHz : 3201.155 2025-05-07T19:43:04.8403488Z cache size : 36608 KB 2025-05-07T19:43:04.8403565Z physical id : 1 2025-05-07T19:43:04.8403641Z siblings : 48 2025-05-07T19:43:04.8403708Z core id : 8 2025-05-07T19:43:04.8403785Z cpu cores : 24 2025-05-07T19:43:04.8403856Z apicid : 81 2025-05-07T19:43:04.8403935Z initial apicid : 81 2025-05-07T19:43:04.8404013Z fpu : yes 2025-05-07T19:43:04.8404091Z fpu_exception : yes 2025-05-07T19:43:04.8404163Z cpuid level : 13 2025-05-07T19:43:04.8404236Z wp : yes 2025-05-07T19:43:04.8406226Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8406584Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8406673Z bogomips : 5999.99 2025-05-07T19:43:04.8406747Z clflush size : 64 2025-05-07T19:43:04.8406826Z cache_alignment : 64 2025-05-07T19:43:04.8406943Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8407029Z power management: 2025-05-07T19:43:04.8407078Z 2025-05-07T19:43:04.8407157Z processor : 81 2025-05-07T19:43:04.8407237Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8407314Z cpu family : 6 2025-05-07T19:43:04.8407387Z model : 85 2025-05-07T19:43:04.8407536Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8407614Z stepping : 7 2025-05-07T19:43:04.8407701Z microcode : 0x5003901 2025-05-07T19:43:04.8407772Z cpu MHz : 3232.106 2025-05-07T19:43:04.8407852Z cache size : 36608 KB 2025-05-07T19:43:04.8407943Z physical id : 1 2025-05-07T19:43:04.8408016Z siblings : 48 2025-05-07T19:43:04.8408088Z core id : 9 2025-05-07T19:43:04.8408160Z cpu cores : 24 2025-05-07T19:43:04.8408237Z apicid : 83 2025-05-07T19:43:04.8408313Z initial apicid : 83 2025-05-07T19:43:04.8408381Z fpu : yes 2025-05-07T19:43:04.8408456Z fpu_exception : yes 2025-05-07T19:43:04.8408535Z cpuid level : 13 2025-05-07T19:43:04.8408603Z wp : yes 2025-05-07T19:43:04.8410572Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8410939Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8411013Z bogomips : 5999.99 2025-05-07T19:43:04.8411095Z clflush size : 64 2025-05-07T19:43:04.8411170Z cache_alignment : 64 2025-05-07T19:43:04.8411288Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8411366Z power management: 2025-05-07T19:43:04.8411370Z 2025-05-07T19:43:04.8411461Z processor : 82 2025-05-07T19:43:04.8411539Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8411609Z cpu family : 6 2025-05-07T19:43:04.8411687Z model : 85 2025-05-07T19:43:04.8411888Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8411960Z stepping : 7 2025-05-07T19:43:04.8412035Z microcode : 0x5003901 2025-05-07T19:43:04.8412122Z cpu MHz : 3221.044 2025-05-07T19:43:04.8412194Z cache size : 36608 KB 2025-05-07T19:43:04.8412268Z physical id : 1 2025-05-07T19:43:04.8412355Z siblings : 48 2025-05-07T19:43:04.8412427Z core id : 10 2025-05-07T19:43:04.8412498Z cpu cores : 24 2025-05-07T19:43:04.8412572Z apicid : 85 2025-05-07T19:43:04.8412662Z initial apicid : 85 2025-05-07T19:43:04.8412730Z fpu : yes 2025-05-07T19:43:04.8412811Z fpu_exception : yes 2025-05-07T19:43:04.8412889Z cpuid level : 13 2025-05-07T19:43:04.8412959Z wp : yes 2025-05-07T19:43:04.8414936Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8415301Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8415379Z bogomips : 5999.99 2025-05-07T19:43:04.8415460Z clflush size : 64 2025-05-07T19:43:04.8415536Z cache_alignment : 64 2025-05-07T19:43:04.8415663Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8415740Z power management: 2025-05-07T19:43:04.8415745Z 2025-05-07T19:43:04.8415815Z processor : 83 2025-05-07T19:43:04.8415947Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8416023Z cpu family : 6 2025-05-07T19:43:04.8416091Z model : 85 2025-05-07T19:43:04.8416242Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8416321Z stepping : 7 2025-05-07T19:43:04.8416399Z microcode : 0x5003901 2025-05-07T19:43:04.8416472Z cpu MHz : 3212.721 2025-05-07T19:43:04.8416553Z cache size : 36608 KB 2025-05-07T19:43:04.8416623Z physical id : 1 2025-05-07T19:43:04.8416701Z siblings : 48 2025-05-07T19:43:04.8416792Z core id : 11 2025-05-07T19:43:04.8416865Z cpu cores : 24 2025-05-07T19:43:04.8416937Z apicid : 87 2025-05-07T19:43:04.8417030Z initial apicid : 87 2025-05-07T19:43:04.8417110Z fpu : yes 2025-05-07T19:43:04.8417194Z fpu_exception : yes 2025-05-07T19:43:04.8417267Z cpuid level : 13 2025-05-07T19:43:04.8417352Z wp : yes 2025-05-07T19:43:04.8419408Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8419962Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8420054Z bogomips : 5999.99 2025-05-07T19:43:04.8420139Z clflush size : 64 2025-05-07T19:43:04.8420226Z cache_alignment : 64 2025-05-07T19:43:04.8420360Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8420440Z power management: 2025-05-07T19:43:04.8420444Z 2025-05-07T19:43:04.8420523Z processor : 84 2025-05-07T19:43:04.8420624Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8420706Z cpu family : 6 2025-05-07T19:43:04.8420783Z model : 85 2025-05-07T19:43:04.8420943Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8421034Z stepping : 7 2025-05-07T19:43:04.8421168Z microcode : 0x5003901 2025-05-07T19:43:04.8421246Z cpu MHz : 3242.658 2025-05-07T19:43:04.8421343Z cache size : 36608 KB 2025-05-07T19:43:04.8421432Z physical id : 1 2025-05-07T19:43:04.8421514Z siblings : 48 2025-05-07T19:43:04.8421593Z core id : 12 2025-05-07T19:43:04.8421689Z cpu cores : 24 2025-05-07T19:43:04.8421767Z apicid : 89 2025-05-07T19:43:04.8421849Z initial apicid : 89 2025-05-07T19:43:04.8421926Z fpu : yes 2025-05-07T19:43:04.8422787Z fpu_exception : yes 2025-05-07T19:43:04.8422875Z cpuid level : 13 2025-05-07T19:43:04.8422953Z wp : yes 2025-05-07T19:43:04.8425109Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8425505Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8425588Z bogomips : 5999.99 2025-05-07T19:43:04.8425687Z clflush size : 64 2025-05-07T19:43:04.8425779Z cache_alignment : 64 2025-05-07T19:43:04.8425906Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8426004Z power management: 2025-05-07T19:43:04.8426009Z 2025-05-07T19:43:04.8426093Z processor : 85 2025-05-07T19:43:04.8426187Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8426282Z cpu family : 6 2025-05-07T19:43:04.8426363Z model : 85 2025-05-07T19:43:04.8426632Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8426715Z stepping : 7 2025-05-07T19:43:04.8426805Z microcode : 0x5003901 2025-05-07T19:43:04.8426883Z cpu MHz : 2999.996 2025-05-07T19:43:04.8426963Z cache size : 36608 KB 2025-05-07T19:43:04.8427055Z physical id : 1 2025-05-07T19:43:04.8427130Z siblings : 48 2025-05-07T19:43:04.8427213Z core id : 13 2025-05-07T19:43:04.8427290Z cpu cores : 24 2025-05-07T19:43:04.8427387Z apicid : 91 2025-05-07T19:43:04.8427473Z initial apicid : 91 2025-05-07T19:43:04.8427546Z fpu : yes 2025-05-07T19:43:04.8427628Z fpu_exception : yes 2025-05-07T19:43:04.8427731Z cpuid level : 13 2025-05-07T19:43:04.8427811Z wp : yes 2025-05-07T19:43:04.8429946Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8430360Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8430447Z bogomips : 5999.99 2025-05-07T19:43:04.8430523Z clflush size : 64 2025-05-07T19:43:04.8430629Z cache_alignment : 64 2025-05-07T19:43:04.8430765Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8430848Z power management: 2025-05-07T19:43:04.8430853Z 2025-05-07T19:43:04.8430947Z processor : 86 2025-05-07T19:43:04.8431041Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8431123Z cpu family : 6 2025-05-07T19:43:04.8431195Z model : 85 2025-05-07T19:43:04.8431368Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8431454Z stepping : 7 2025-05-07T19:43:04.8431537Z microcode : 0x5003901 2025-05-07T19:43:04.8431633Z cpu MHz : 3322.738 2025-05-07T19:43:04.8431809Z cache size : 36608 KB 2025-05-07T19:43:04.8431889Z physical id : 1 2025-05-07T19:43:04.8431965Z siblings : 48 2025-05-07T19:43:04.8432059Z core id : 14 2025-05-07T19:43:04.8432151Z cpu cores : 24 2025-05-07T19:43:04.8432227Z apicid : 93 2025-05-07T19:43:04.8432320Z initial apicid : 93 2025-05-07T19:43:04.8432402Z fpu : yes 2025-05-07T19:43:04.8432486Z fpu_exception : yes 2025-05-07T19:43:04.8432565Z cpuid level : 13 2025-05-07T19:43:04.8432662Z wp : yes 2025-05-07T19:43:04.8434985Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8435355Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8435435Z bogomips : 5999.99 2025-05-07T19:43:04.8435512Z clflush size : 64 2025-05-07T19:43:04.8435589Z cache_alignment : 64 2025-05-07T19:43:04.8435726Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8435814Z power management: 2025-05-07T19:43:04.8435818Z 2025-05-07T19:43:04.8435892Z processor : 87 2025-05-07T19:43:04.8435985Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8436059Z cpu family : 6 2025-05-07T19:43:04.8436134Z model : 85 2025-05-07T19:43:04.8436278Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8436420Z stepping : 7 2025-05-07T19:43:04.8436507Z microcode : 0x5003901 2025-05-07T19:43:04.8436579Z cpu MHz : 3134.553 2025-05-07T19:43:04.8436663Z cache size : 36608 KB 2025-05-07T19:43:04.8436743Z physical id : 1 2025-05-07T19:43:04.8436818Z siblings : 48 2025-05-07T19:43:04.8436889Z core id : 15 2025-05-07T19:43:04.8436980Z cpu cores : 24 2025-05-07T19:43:04.8437053Z apicid : 95 2025-05-07T19:43:04.8437128Z initial apicid : 95 2025-05-07T19:43:04.8437214Z fpu : yes 2025-05-07T19:43:04.8437299Z fpu_exception : yes 2025-05-07T19:43:04.8437377Z cpuid level : 13 2025-05-07T19:43:04.8437447Z wp : yes 2025-05-07T19:43:04.8439447Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8439812Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8439902Z bogomips : 5999.99 2025-05-07T19:43:04.8439978Z clflush size : 64 2025-05-07T19:43:04.8440057Z cache_alignment : 64 2025-05-07T19:43:04.8440174Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8440266Z power management: 2025-05-07T19:43:04.8440270Z 2025-05-07T19:43:04.8440351Z processor : 88 2025-05-07T19:43:04.8440437Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8440523Z cpu family : 6 2025-05-07T19:43:04.8440598Z model : 85 2025-05-07T19:43:04.8440746Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8440817Z stepping : 7 2025-05-07T19:43:04.8440917Z microcode : 0x5003901 2025-05-07T19:43:04.8440995Z cpu MHz : 2999.996 2025-05-07T19:43:04.8441071Z cache size : 36608 KB 2025-05-07T19:43:04.8441157Z physical id : 1 2025-05-07T19:43:04.8441282Z siblings : 48 2025-05-07T19:43:04.8441355Z core id : 16 2025-05-07T19:43:04.8441428Z cpu cores : 24 2025-05-07T19:43:04.8441512Z apicid : 97 2025-05-07T19:43:04.8441583Z initial apicid : 97 2025-05-07T19:43:04.8441650Z fpu : yes 2025-05-07T19:43:04.8441738Z fpu_exception : yes 2025-05-07T19:43:04.8441814Z cpuid level : 13 2025-05-07T19:43:04.8441885Z wp : yes 2025-05-07T19:43:04.8443873Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8444234Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8444316Z bogomips : 5999.99 2025-05-07T19:43:04.8444392Z clflush size : 64 2025-05-07T19:43:04.8444465Z cache_alignment : 64 2025-05-07T19:43:04.8444580Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8444659Z power management: 2025-05-07T19:43:04.8444663Z 2025-05-07T19:43:04.8444747Z processor : 89 2025-05-07T19:43:04.8444827Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8444897Z cpu family : 6 2025-05-07T19:43:04.8444978Z model : 85 2025-05-07T19:43:04.8445122Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8445196Z stepping : 7 2025-05-07T19:43:04.8445274Z microcode : 0x5003901 2025-05-07T19:43:04.8445427Z cpu MHz : 3233.555 2025-05-07T19:43:04.8445511Z cache size : 36608 KB 2025-05-07T19:43:04.8445586Z physical id : 1 2025-05-07T19:43:04.8445666Z siblings : 48 2025-05-07T19:43:04.8445744Z core id : 17 2025-05-07T19:43:04.8445820Z cpu cores : 24 2025-05-07T19:43:04.8445895Z apicid : 99 2025-05-07T19:43:04.8445994Z initial apicid : 99 2025-05-07T19:43:04.8446068Z fpu : yes 2025-05-07T19:43:04.8446146Z fpu_exception : yes 2025-05-07T19:43:04.8446244Z cpuid level : 13 2025-05-07T19:43:04.8446313Z wp : yes 2025-05-07T19:43:04.8448291Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8448652Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8448732Z bogomips : 5999.99 2025-05-07T19:43:04.8448801Z clflush size : 64 2025-05-07T19:43:04.8448895Z cache_alignment : 64 2025-05-07T19:43:04.8449018Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8449100Z power management: 2025-05-07T19:43:04.8449104Z 2025-05-07T19:43:04.8449176Z processor : 90 2025-05-07T19:43:04.8449274Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8449352Z cpu family : 6 2025-05-07T19:43:04.8449423Z model : 85 2025-05-07T19:43:04.8449580Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8449653Z stepping : 7 2025-05-07T19:43:04.8449728Z microcode : 0x5003901 2025-05-07T19:43:04.8449800Z cpu MHz : 3413.039 2025-05-07T19:43:04.8449883Z cache size : 36608 KB 2025-05-07T19:43:04.8449954Z physical id : 1 2025-05-07T19:43:04.8450022Z siblings : 48 2025-05-07T19:43:04.8450103Z core id : 18 2025-05-07T19:43:04.8450225Z cpu cores : 24 2025-05-07T19:43:04.8450302Z apicid : 101 2025-05-07T19:43:04.8450378Z initial apicid : 101 2025-05-07T19:43:04.8450464Z fpu : yes 2025-05-07T19:43:04.8450543Z fpu_exception : yes 2025-05-07T19:43:04.8450617Z cpuid level : 13 2025-05-07T19:43:04.8450686Z wp : yes 2025-05-07T19:43:04.8452659Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8453025Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8453109Z bogomips : 5999.99 2025-05-07T19:43:04.8453183Z clflush size : 64 2025-05-07T19:43:04.8453259Z cache_alignment : 64 2025-05-07T19:43:04.8453390Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8453467Z power management: 2025-05-07T19:43:04.8453471Z 2025-05-07T19:43:04.8453548Z processor : 91 2025-05-07T19:43:04.8453627Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8453714Z cpu family : 6 2025-05-07T19:43:04.8453785Z model : 85 2025-05-07T19:43:04.8453930Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8454017Z stepping : 7 2025-05-07T19:43:04.8454089Z microcode : 0x5003901 2025-05-07T19:43:04.8454167Z cpu MHz : 3263.427 2025-05-07T19:43:04.8454240Z cache size : 36608 KB 2025-05-07T19:43:04.8454365Z physical id : 1 2025-05-07T19:43:04.8454435Z siblings : 48 2025-05-07T19:43:04.8454505Z core id : 19 2025-05-07T19:43:04.8454576Z cpu cores : 24 2025-05-07T19:43:04.8454668Z apicid : 103 2025-05-07T19:43:04.8454748Z initial apicid : 103 2025-05-07T19:43:04.8454818Z fpu : yes 2025-05-07T19:43:04.8454908Z fpu_exception : yes 2025-05-07T19:43:04.8454985Z cpuid level : 13 2025-05-07T19:43:04.8455054Z wp : yes 2025-05-07T19:43:04.8457041Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8457399Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8457476Z bogomips : 5999.99 2025-05-07T19:43:04.8457562Z clflush size : 64 2025-05-07T19:43:04.8457645Z cache_alignment : 64 2025-05-07T19:43:04.8457763Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8457838Z power management: 2025-05-07T19:43:04.8457850Z 2025-05-07T19:43:04.8457921Z processor : 92 2025-05-07T19:43:04.8458000Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8458070Z cpu family : 6 2025-05-07T19:43:04.8458148Z model : 85 2025-05-07T19:43:04.8458298Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8458371Z stepping : 7 2025-05-07T19:43:04.8458456Z microcode : 0x5003901 2025-05-07T19:43:04.8458532Z cpu MHz : 3235.124 2025-05-07T19:43:04.8458618Z cache size : 36608 KB 2025-05-07T19:43:04.8458695Z physical id : 1 2025-05-07T19:43:04.8458788Z siblings : 48 2025-05-07T19:43:04.8458858Z core id : 20 2025-05-07T19:43:04.8458929Z cpu cores : 24 2025-05-07T19:43:04.8458998Z apicid : 105 2025-05-07T19:43:04.8459083Z initial apicid : 105 2025-05-07T19:43:04.8459282Z fpu : yes 2025-05-07T19:43:04.8459363Z fpu_exception : yes 2025-05-07T19:43:04.8459446Z cpuid level : 13 2025-05-07T19:43:04.8459517Z wp : yes 2025-05-07T19:43:04.8461803Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8462197Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8462280Z bogomips : 5999.99 2025-05-07T19:43:04.8462360Z clflush size : 64 2025-05-07T19:43:04.8462450Z cache_alignment : 64 2025-05-07T19:43:04.8462575Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8462654Z power management: 2025-05-07T19:43:04.8462658Z 2025-05-07T19:43:04.8462739Z processor : 93 2025-05-07T19:43:04.8462822Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8462894Z cpu family : 6 2025-05-07T19:43:04.8462964Z model : 85 2025-05-07T19:43:04.8463124Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8463200Z stepping : 7 2025-05-07T19:43:04.8463278Z microcode : 0x5003901 2025-05-07T19:43:04.8463352Z cpu MHz : 3193.489 2025-05-07T19:43:04.8463441Z cache size : 36608 KB 2025-05-07T19:43:04.8463517Z physical id : 1 2025-05-07T19:43:04.8463592Z siblings : 48 2025-05-07T19:43:04.8463674Z core id : 21 2025-05-07T19:43:04.8463814Z cpu cores : 24 2025-05-07T19:43:04.8463907Z apicid : 107 2025-05-07T19:43:04.8464008Z initial apicid : 107 2025-05-07T19:43:04.8464122Z fpu : yes 2025-05-07T19:43:04.8464225Z fpu_exception : yes 2025-05-07T19:43:04.8464315Z cpuid level : 13 2025-05-07T19:43:04.8464435Z wp : yes 2025-05-07T19:43:04.8466579Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8466983Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8467106Z bogomips : 5999.99 2025-05-07T19:43:04.8467201Z clflush size : 64 2025-05-07T19:43:04.8467302Z cache_alignment : 64 2025-05-07T19:43:04.8467463Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8467560Z power management: 2025-05-07T19:43:04.8467565Z 2025-05-07T19:43:04.8467659Z processor : 94 2025-05-07T19:43:04.8467766Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8467886Z cpu family : 6 2025-05-07T19:43:04.8467977Z model : 85 2025-05-07T19:43:04.8468149Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8468265Z stepping : 7 2025-05-07T19:43:04.8468356Z microcode : 0x5003901 2025-05-07T19:43:04.8468451Z cpu MHz : 3767.243 2025-05-07T19:43:04.8468546Z cache size : 36608 KB 2025-05-07T19:43:04.8468666Z physical id : 1 2025-05-07T19:43:04.8468759Z siblings : 48 2025-05-07T19:43:04.8468844Z core id : 22 2025-05-07T19:43:04.8468956Z cpu cores : 24 2025-05-07T19:43:04.8469053Z apicid : 109 2025-05-07T19:43:04.8469155Z initial apicid : 109 2025-05-07T19:43:04.8469241Z fpu : yes 2025-05-07T19:43:04.8469363Z fpu_exception : yes 2025-05-07T19:43:04.8469497Z cpuid level : 13 2025-05-07T19:43:04.8469584Z wp : yes 2025-05-07T19:43:04.8471755Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8472244Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8472337Z bogomips : 5999.99 2025-05-07T19:43:04.8472456Z clflush size : 64 2025-05-07T19:43:04.8472548Z cache_alignment : 64 2025-05-07T19:43:04.8472685Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8472797Z power management: 2025-05-07T19:43:04.8472801Z 2025-05-07T19:43:04.8472891Z processor : 95 2025-05-07T19:43:04.8472987Z vendor_id : GenuineIntel 2025-05-07T19:43:04.8473073Z cpu family : 6 2025-05-07T19:43:04.8473184Z model : 85 2025-05-07T19:43:04.8473343Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.8473432Z stepping : 7 2025-05-07T19:43:04.8473544Z microcode : 0x5003901 2025-05-07T19:43:04.8473631Z cpu MHz : 2999.996 2025-05-07T19:43:04.8473719Z cache size : 36608 KB 2025-05-07T19:43:04.8473806Z physical id : 1 2025-05-07T19:43:04.8473915Z siblings : 48 2025-05-07T19:43:04.8473998Z core id : 23 2025-05-07T19:43:04.8474083Z cpu cores : 24 2025-05-07T19:43:04.8474189Z apicid : 111 2025-05-07T19:43:04.8474613Z initial apicid : 111 2025-05-07T19:43:04.8474698Z fpu : yes 2025-05-07T19:43:04.8474788Z fpu_exception : yes 2025-05-07T19:43:04.8474900Z cpuid level : 13 2025-05-07T19:43:04.8474990Z wp : yes 2025-05-07T19:43:04.8476990Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.8477386Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.8477480Z bogomips : 5999.99 2025-05-07T19:43:04.8477571Z clflush size : 64 2025-05-07T19:43:04.8477693Z cache_alignment : 64 2025-05-07T19:43:04.8477826Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.8477910Z power management: 2025-05-07T19:43:04.8477914Z 2025-05-07T19:43:04.8477917Z 2025-05-07T19:43:04.8478039Z ################################################################################ 2025-05-07T19:43:04.8478136Z [INFO] Print PCI info ... 2025-05-07T19:43:04.8478218Z + lspci -v 2025-05-07T19:43:04.8478222Z 2025-05-07T19:43:04.8478452Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:43:04.8478568Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:43:04.8478687Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:43:04.8478692Z 2025-05-07T19:43:04.8478895Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:43:04.8478982Z Physical Slot: 1 2025-05-07T19:43:04.8479094Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:04.8479099Z 2025-05-07T19:43:04.8479360Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:43:04.8479498Z Physical Slot: 1 2025-05-07T19:43:04.8479626Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:43:04.8479630Z 2025-05-07T19:43:04.8479906Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:43:04.8479984Z Physical Slot: 3 2025-05-07T19:43:04.8480096Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:04.8480250Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:43:04.8480370Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:43:04.8480374Z 2025-05-07T19:43:04.8480672Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:43:04.8480781Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:43:04.8480886Z Physical Slot: 4 2025-05-07T19:43:04.8481010Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:43:04.8481160Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:04.8481264Z Capabilities: 2025-05-07T19:43:04.8481346Z Kernel driver in use: nvme 2025-05-07T19:43:04.8481350Z 2025-05-07T19:43:04.8481565Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:43:04.8481655Z Physical Slot: 5 2025-05-07T19:43:04.8481756Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:04.8481897Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:04.8482019Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:43:04.8482172Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:43:04.8482275Z Capabilities: 2025-05-07T19:43:04.8482371Z Kernel driver in use: ena 2025-05-07T19:43:04.8482375Z 2025-05-07T19:43:04.8482379Z 2025-05-07T19:43:04.8482549Z ################################################################################ 2025-05-07T19:43:04.8482654Z [INFO] Print Linux distribution info ... 2025-05-07T19:43:04.8482740Z + uname -a 2025-05-07T19:43:04.8482748Z 2025-05-07T19:43:04.8483133Z Linux 116e1204f840 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:43:04.8483138Z 2025-05-07T19:43:04.8483217Z + uname -m 2025-05-07T19:43:04.8483221Z 2025-05-07T19:43:04.8483286Z x86_64 2025-05-07T19:43:04.8483291Z 2025-05-07T19:43:04.8483384Z + cat /proc/version 2025-05-07T19:43:04.8483388Z 2025-05-07T19:43:04.8483948Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:43:04.8483953Z 2025-05-07T19:43:04.8484029Z + cat /etc/os-release 2025-05-07T19:43:04.8484033Z 2025-05-07T19:43:04.8484130Z NAME="Amazon Linux" 2025-05-07T19:43:04.8484208Z VERSION="2023" 2025-05-07T19:43:04.8484288Z ID="amzn" 2025-05-07T19:43:04.8484395Z ID_LIKE="fedora" 2025-05-07T19:43:04.8484468Z VERSION_ID="2023" 2025-05-07T19:43:04.8484567Z PLATFORM_ID="platform:al2023" 2025-05-07T19:43:04.8484679Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:43:04.8484767Z ANSI_COLOR="0;33" 2025-05-07T19:43:04.8484889Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:43:04.8485070Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:43:04.8485249Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:43:04.8485405Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:43:04.8485588Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:43:04.8485677Z VENDOR_NAME="AWS" 2025-05-07T19:43:04.8485803Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:43:04.8485894Z SUPPORT_END="2029-06-30" 2025-05-07T19:43:04.8485898Z 2025-05-07T19:43:04.8520795Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:43:04.8520964Z . $PRELUDE; print_gpu_info 2025-05-07T19:43:04.8521250Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:04.8521327Z env: 2025-05-07T19:43:04.8521516Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:04.8521615Z BUILD_ENV: build_binary 2025-05-07T19:43:04.8521699Z BUILD_TARGET: default 2025-05-07T19:43:04.8521777Z BUILD_VARIANT: cuda 2025-05-07T19:43:04.8521869Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:04.8522112Z ##[endgroup] 2025-05-07T19:43:05.2481115Z ################################################################################ 2025-05-07T19:43:05.2482234Z [INFO] Printing general display info ... 2025-05-07T19:43:05.2504201Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:05.3452851Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:05.3459654Z /usr/bin/sudo 2025-05-07T19:43:05.3469945Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:05.3474817Z /usr/bin/yum 2025-05-07T19:43:05.3476531Z [INSTALL] Updating system repositories ... 2025-05-07T19:43:05.3502080Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:43:05.5667641Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:43:05.6632005Z Dependencies resolved. 2025-05-07T19:43:05.6849810Z Nothing to do. 2025-05-07T19:43:05.6851028Z Complete! 2025-05-07T19:43:05.7532803Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:43:05.7559316Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:43:05.9806238Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:43:06.0323725Z Dependencies resolved. 2025-05-07T19:43:06.0489597Z ================================================================================ 2025-05-07T19:43:06.0491069Z Package Arch Version Repository Size 2025-05-07T19:43:06.0492306Z ================================================================================ 2025-05-07T19:43:06.0493214Z Installing: 2025-05-07T19:43:06.0494162Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:43:06.0494642Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:43:06.0494944Z 2025-05-07T19:43:06.0495043Z Transaction Summary 2025-05-07T19:43:06.0495327Z ================================================================================ 2025-05-07T19:43:06.0495638Z Install 2 Packages 2025-05-07T19:43:06.0495778Z 2025-05-07T19:43:06.0495908Z Total download size: 347 k 2025-05-07T19:43:06.0496171Z Installed size: 883 k 2025-05-07T19:43:06.0496449Z Downloading Packages: 2025-05-07T19:43:06.3457915Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.6 MB/s | 28 kB 00:00 2025-05-07T19:43:06.3565829Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 11 MB/s | 319 kB 00:00 2025-05-07T19:43:06.3573275Z -------------------------------------------------------------------------------- 2025-05-07T19:43:06.3576231Z Total 1.1 MB/s | 347 kB 00:00 2025-05-07T19:43:06.3813307Z Running transaction check 2025-05-07T19:43:06.3864151Z Transaction check succeeded. 2025-05-07T19:43:06.3865026Z Running transaction test 2025-05-07T19:43:06.4022584Z Transaction test succeeded. 2025-05-07T19:43:06.4023910Z Running transaction 2025-05-07T19:43:06.4309711Z Preparing : 1/1 2025-05-07T19:43:06.4389887Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:06.4427804Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:07.4659604Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:07.4660698Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:07.5028753Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:07.5029806Z 2025-05-07T19:43:07.5030059Z Installed: 2025-05-07T19:43:07.5031055Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:07.5032604Z 2025-05-07T19:43:07.5032848Z Complete! 2025-05-07T19:43:07.5410211Z + hostname 2025-05-07T19:43:07.5410540Z 2025-05-07T19:43:07.5419417Z 116e1204f840 2025-05-07T19:43:07.5419982Z 2025-05-07T19:43:07.5420282Z + sudo lshw -C display 2025-05-07T19:43:07.5420747Z 2025-05-07T19:43:07.7391731Z *-display UNCLAIMED 2025-05-07T19:43:07.7392664Z description: VGA compatible controller 2025-05-07T19:43:07.7393465Z product: Amazon.com, Inc. 2025-05-07T19:43:07.7393917Z vendor: Amazon.com, Inc. 2025-05-07T19:43:07.7394226Z physical id: 3 2025-05-07T19:43:07.7394480Z bus info: pci@0000:00:03.0 2025-05-07T19:43:07.7394780Z version: 00 2025-05-07T19:43:07.7395031Z width: 32 bits 2025-05-07T19:43:07.7395403Z clock: 33MHz 2025-05-07T19:43:07.7395656Z capabilities: vga_controller bus_master 2025-05-07T19:43:07.7396000Z configuration: latency=0 2025-05-07T19:43:07.7396329Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:07.7416754Z 2025-05-07T19:43:07.7417338Z ################################################################################ 2025-05-07T19:43:07.7418445Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:07.7529781Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:07.7551372Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:07.7552801Z [CHECK] nvidia-smi not found 2025-05-07T19:43:07.7553670Z ################################################################################ 2025-05-07T19:43:07.7554652Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:07.7662235Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:07.7687534Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:07.7688043Z [CHECK] rocminfo not found 2025-05-07T19:43:07.7703488Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:07.7704063Z [CHECK] rocm-smi not found 2025-05-07T19:43:07.7782319Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:07.7782841Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:07.7783455Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:07.7783838Z env: 2025-05-07T19:43:07.7784088Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:07.7784439Z BUILD_ENV: build_binary 2025-05-07T19:43:07.7784706Z BUILD_TARGET: default 2025-05-07T19:43:07.7784983Z BUILD_VARIANT: cuda 2025-05-07T19:43:07.7785265Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:07.7785534Z ##[endgroup] 2025-05-07T19:43:08.2153310Z ################################################################################ 2025-05-07T19:43:08.2154370Z # Setup Miniconda 2025-05-07T19:43:08.2155006Z # 2025-05-07T19:43:08.2179877Z # [2025-05-07T19:43:08.217Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:08.2180355Z ################################################################################ 2025-05-07T19:43:08.2180754Z 2025-05-07T19:43:08.2198565Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:08.3054730Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:08.3055298Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:08.3055503Z 2025-05-07T19:43:08.3065818Z 2025-05-07T19:43:08.3066165Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:08.3090155Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:09.4776482Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:09.4777597Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:09.4778349Z 2025-05-07T19:43:09.4924432Z PREFIX=/github/home/miniconda 2025-05-07T19:43:09.8531131Z Unpacking payload ... 2025-05-07T19:43:10.3344034Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:10.9986569Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:12.8639336Z 2025-05-07T19:43:12.8640317Z Installing base environment... 2025-05-07T19:43:12.8640962Z 2025-05-07T19:43:13.8579079Z Preparing transaction: ...working... done 2025-05-07T19:43:16.7270153Z Executing transaction: ...working... done 2025-05-07T19:43:17.2788433Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:17.3494825Z installation finished. 2025-05-07T19:43:17.3501337Z 2025-05-07T19:43:17.3501469Z + rm -f miniconda.sh 2025-05-07T19:43:17.3501642Z 2025-05-07T19:43:17.3679358Z 2025-05-07T19:43:17.3679792Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:17.3680190Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:17.3680460Z 2025-05-07T19:43:17.7373292Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:17.7373817Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:17.7374217Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:17.7374613Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:17.7374989Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:17.7375428Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:17.7375888Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:17.7376361Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:17.7376942Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:17.7377488Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:17.7378461Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:17.7378837Z modified /github/home/.bashrc 2025-05-07T19:43:17.7379022Z 2025-05-07T19:43:17.7379387Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:17.7379878Z 2025-05-07T19:43:17.7902538Z 2025-05-07T19:43:17.7902912Z + . /github/home/.bashrc 2025-05-07T19:43:17.7903108Z 2025-05-07T19:43:18.5861832Z 2025-05-07T19:43:18.5862780Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:18.5893014Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:30.4188021Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:31.8616337Z Solving environment: | / - \ | / - \ | / - done 2025-05-07T19:43:31.9507691Z 2025-05-07T19:43:31.9508650Z ## Package Plan ## 2025-05-07T19:43:31.9508886Z 2025-05-07T19:43:31.9509031Z environment location: /github/home/miniconda 2025-05-07T19:43:31.9509324Z 2025-05-07T19:43:31.9509430Z added / updated specs: 2025-05-07T19:43:31.9509709Z - conda-libmamba-solver 2025-05-07T19:43:31.9509994Z - libarchive 2025-05-07T19:43:31.9510211Z - libmamba 2025-05-07T19:43:31.9510432Z - libmambapy 2025-05-07T19:43:31.9510561Z 2025-05-07T19:43:31.9510567Z 2025-05-07T19:43:31.9510708Z The following packages will be downloaded: 2025-05-07T19:43:31.9510932Z 2025-05-07T19:43:31.9511049Z package | build 2025-05-07T19:43:31.9511850Z ---------------------------|----------------- 2025-05-07T19:43:31.9512320Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:31.9513256Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:31.9513716Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:31.9514211Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:31.9514765Z ------------------------------------------------------------ 2025-05-07T19:43:31.9515248Z Total: 1.4 MB 2025-05-07T19:43:31.9515559Z 2025-05-07T19:43:31.9515676Z The following packages will be UPDATED: 2025-05-07T19:43:31.9515889Z 2025-05-07T19:43:31.9521758Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:31.9522858Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:31.9523323Z 2025-05-07T19:43:31.9523555Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:31.9523897Z 2025-05-07T19:43:31.9524330Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:31.9525369Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:31.9525910Z 2025-05-07T19:43:31.9525914Z 2025-05-07T19:43:31.9525918Z 2025-05-07T19:43:31.9526068Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:31.9526441Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:31.9526691Z 2025-05-07T19:43:31.9527127Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:31.9527380Z 2025-05-07T19:43:31.9527384Z 2025-05-07T19:43:31.9527638Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:31.9527916Z 2025-05-07T19:43:31.9527919Z 2025-05-07T19:43:31.9528187Z 2025-05-07T19:43:32.0038807Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:32.0039139Z 2025-05-07T19:43:32.0039144Z 2025-05-07T19:43:32.0039147Z 2025-05-07T19:43:32.0052091Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:32.0052428Z 2025-05-07T19:43:32.0052814Z 2025-05-07T19:43:32.0171328Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:32.0171636Z 2025-05-07T19:43:32.0171640Z 2025-05-07T19:43:32.0200241Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:32.0200600Z 2025-05-07T19:43:32.0200685Z 2025-05-07T19:43:32.0200691Z 2025-05-07T19:43:32.0248351Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:32.0249295Z 2025-05-07T19:43:32.0319477Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:32.0319768Z 2025-05-07T19:43:32.0503411Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:32.1429118Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:32.1430200Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:32.1433064Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:32.1433726Z 2025-05-07T19:43:32.1433942Z 2025-05-07T19:43:32.1434216Z  2025-05-07T19:43:32.1434503Z 2025-05-07T19:43:32.1434507Z 2025-05-07T19:43:32.1434700Z  2025-05-07T19:43:32.1434936Z 2025-05-07T19:43:32.1434940Z 2025-05-07T19:43:32.1434943Z 2025-05-07T19:43:32.1435171Z  done 2025-05-07T19:43:32.2448093Z Preparing transaction: | done 2025-05-07T19:43:32.3454073Z Verifying transaction: - done 2025-05-07T19:43:33.6484992Z Executing transaction: | / - \ | / - \ | / - \ | done 2025-05-07T19:43:35.2182281Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:35.2214664Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:35.9262312Z Channels: 2025-05-07T19:43:35.9262589Z - defaults 2025-05-07T19:43:35.9262849Z Platform: linux-64 2025-05-07T19:43:36.9993904Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:37.1291890Z Solving environment: / - Channels: 2025-05-07T19:43:37.1292806Z - defaults 2025-05-07T19:43:37.1293450Z Platform: linux-64 2025-05-07T19:43:37.4074565Z Collecting package metadata (repodata.json): | / - \ done 2025-05-07T19:43:37.6204460Z Solving environment: / - \ | done 2025-05-07T19:43:37.7156750Z done 2025-05-07T19:43:37.7799953Z 2025-05-07T19:43:37.7800500Z ## Package Plan ## 2025-05-07T19:43:37.7800965Z 2025-05-07T19:43:37.7801386Z environment location: /github/home/miniconda 2025-05-07T19:43:37.7802134Z 2025-05-07T19:43:37.7802409Z added / updated specs: 2025-05-07T19:43:37.7803163Z - conda 2025-05-07T19:43:37.7803503Z 2025-05-07T19:43:37.7803547Z 2025-05-07T19:43:37.7803895Z The following packages will be downloaded: 2025-05-07T19:43:37.7804578Z 2025-05-07T19:43:37.7804904Z package | build 2025-05-07T19:43:37.7805852Z ---------------------------|----------------- 2025-05-07T19:43:37.7806841Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:37.7807978Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:37.7809072Z ------------------------------------------------------------ 2025-05-07T19:43:37.7810053Z Total: 1.4 MB 2025-05-07T19:43:37.7810673Z 2025-05-07T19:43:37.7810987Z The following packages will be UPDATED: 2025-05-07T19:43:37.7811627Z 2025-05-07T19:43:37.7812600Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:37.7813486Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:37.7813742Z 2025-05-07T19:43:37.7813746Z 2025-05-07T19:43:37.7813750Z 2025-05-07T19:43:37.7813888Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:37.7814269Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:37.7814490Z 2025-05-07T19:43:37.8202874Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:37.8508627Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:37.8509422Z 2025-05-07T19:43:37.9805722Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:37.9806872Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:38.0124718Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:38.0125247Z 2025-05-07T19:43:38.0125620Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:38.0125907Z 2025-05-07T19:43:38.0126129Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:38.0126485Z 2025-05-07T19:43:38.0126693Z 2025-05-07T19:43:38.0126869Z  done 2025-05-07T19:43:38.1141042Z Preparing transaction: - done 2025-05-07T19:43:38.2145367Z Verifying transaction: | done 2025-05-07T19:43:40.2182771Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:40.7668099Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:40.7670246Z + conda clean --packages --tarball -y 2025-05-07T19:43:40.7670463Z 2025-05-07T19:43:41.2132101Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:41.2133072Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:41.2696072Z 2025-05-07T19:43:41.2700054Z + conda clean --all -y 2025-05-07T19:43:41.2700361Z 2025-05-07T19:43:41.7220704Z There are no unused tarball(s) to remove. 2025-05-07T19:43:41.7221532Z Will remove 1 index cache(s). 2025-05-07T19:43:41.7221879Z There are no unused package(s) to remove. 2025-05-07T19:43:41.7222476Z There are no tempfile(s) to remove. 2025-05-07T19:43:41.7222834Z There are no logfile(s) to remove. 2025-05-07T19:43:41.7767671Z 2025-05-07T19:43:41.7768255Z + conda info 2025-05-07T19:43:41.7768460Z 2025-05-07T19:43:42.3405630Z 2025-05-07T19:43:42.3406238Z active environment : base 2025-05-07T19:43:42.3406926Z active env location : /github/home/miniconda 2025-05-07T19:43:42.3407311Z shell level : 1 2025-05-07T19:43:42.3407629Z user config file : /github/home/.condarc 2025-05-07T19:43:42.3408139Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:42.3408538Z conda version : 25.3.1 2025-05-07T19:43:42.3408875Z conda-build version : not installed 2025-05-07T19:43:42.3409193Z python version : 3.13.2.final.0 2025-05-07T19:43:42.3409523Z solver : libmamba (default) 2025-05-07T19:43:42.3409890Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:42.3410241Z __conda=25.3.1=0 2025-05-07T19:43:42.3410526Z __glibc=2.34=0 2025-05-07T19:43:42.3410848Z __linux=6.1.130=0 2025-05-07T19:43:42.3411136Z __unix=0=0 2025-05-07T19:43:42.3411502Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:42.3411935Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:42.3412283Z conda av metadata url : None 2025-05-07T19:43:42.3412691Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:42.3413123Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:42.3413539Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:42.3413923Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:42.3414333Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:42.3414972Z /github/home/.conda/pkgs 2025-05-07T19:43:42.3415362Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:42.3415738Z /github/home/.conda/envs 2025-05-07T19:43:42.3416054Z platform : linux-64 2025-05-07T19:43:42.3416941Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:42.3417787Z UID:GID : 0:0 2025-05-07T19:43:42.3418084Z netrc file : None 2025-05-07T19:43:42.3418359Z offline mode : False 2025-05-07T19:43:42.3418563Z 2025-05-07T19:43:42.3986927Z 2025-05-07T19:43:42.3987540Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:42.3988679Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_9371bc97-346a-4a99-ba5c-1b656315b997 ... 2025-05-07T19:43:42.3989411Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:42.4159357Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.10 2025-05-07T19:43:42.4159886Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.10 2025-05-07T19:43:42.4160590Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:42.4160909Z env: 2025-05-07T19:43:42.4161135Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:42.4161417Z BUILD_ENV: build_binary 2025-05-07T19:43:42.4161661Z BUILD_TARGET: default 2025-05-07T19:43:42.4161874Z BUILD_VARIANT: cuda 2025-05-07T19:43:42.4162106Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:42.4162341Z ##[endgroup] 2025-05-07T19:43:42.8347592Z ################################################################################ 2025-05-07T19:43:42.8348001Z # Create Conda Environment 2025-05-07T19:43:42.8348247Z # 2025-05-07T19:43:42.8366105Z # [2025-05-07T19:43:42.835Z] + create_conda_environment build_binary 3.10 2025-05-07T19:43:42.8367916Z ################################################################################ 2025-05-07T19:43:42.8368586Z 2025-05-07T19:43:42.8390491Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:42.9253173Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:42.9253821Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:42.9254157Z + conda info --envs 2025-05-07T19:43:42.9254315Z 2025-05-07T19:43:43.4931147Z 2025-05-07T19:43:43.4931687Z # conda environments: 2025-05-07T19:43:43.4931955Z # 2025-05-07T19:43:43.4932208Z base /github/home/miniconda 2025-05-07T19:43:43.4932433Z 2025-05-07T19:43:43.5526091Z 2025-05-07T19:43:43.5526448Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:45.2094549Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:45.2094941Z 2025-05-07T19:43:45.2118476Z 2025-05-07T19:43:45.2134405Z [SETUP] Creating new Conda environment (Python 3.10) ... 2025-05-07T19:43:45.2164025Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.10 2025-05-07T19:43:45.7983318Z Channels: 2025-05-07T19:43:45.7983622Z - defaults 2025-05-07T19:43:45.7983843Z Platform: linux-64 2025-05-07T19:43:47.1797401Z Collecting package metadata (repodata.json): - \ | / - \ | / - done 2025-05-07T19:43:47.2804571Z Solving environment: | done 2025-05-07T19:43:47.3097994Z 2025-05-07T19:43:47.3098426Z ## Package Plan ## 2025-05-07T19:43:47.3098629Z 2025-05-07T19:43:47.3098848Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:47.3099162Z 2025-05-07T19:43:47.3099273Z added / updated specs: 2025-05-07T19:43:47.3099660Z - python=3.10 2025-05-07T19:43:47.3099811Z 2025-05-07T19:43:47.3099815Z 2025-05-07T19:43:47.3099935Z The following packages will be downloaded: 2025-05-07T19:43:47.3100156Z 2025-05-07T19:43:47.3100271Z package | build 2025-05-07T19:43:47.3100642Z ---------------------------|----------------- 2025-05-07T19:43:47.3101031Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:47.3101434Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:47.3101881Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:47.3102295Z python-3.10.16 | he870216_1 26.9 MB 2025-05-07T19:43:47.3102712Z setuptools-78.1.1 | py310h06a4308_0 1.7 MB 2025-05-07T19:43:47.3103111Z wheel-0.45.1 | py310h06a4308_0 115 KB 2025-05-07T19:43:47.3103492Z ------------------------------------------------------------ 2025-05-07T19:43:47.3103847Z Total: 28.8 MB 2025-05-07T19:43:47.3104063Z 2025-05-07T19:43:47.3104192Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:47.3104515Z 2025-05-07T19:43:47.3104791Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:47.3105290Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:47.3106041Z bzip2 pkgs/main/linux-64::bzip2-1.0.8-h5eee18b_6 2025-05-07T19:43:47.3106558Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:47.3107127Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:47.3107619Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:47.3108060Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:47.3108527Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:47.3109009Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:47.3109476Z libuuid pkgs/main/linux-64::libuuid-1.41.5-h5eee18b_0 2025-05-07T19:43:47.3109920Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:47.3111652Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:47.3112095Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:47.3112618Z python pkgs/main/linux-64::python-3.10.16-he870216_1 2025-05-07T19:43:47.3113149Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:47.3113653Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py310h06a4308_0 2025-05-07T19:43:47.3114135Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:47.3114544Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:47.3114947Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:47.3115374Z wheel pkgs/main/linux-64::wheel-0.45.1-py310h06a4308_0 2025-05-07T19:43:47.3115785Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:47.3116158Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:47.3116431Z 2025-05-07T19:43:47.3116435Z 2025-05-07T19:43:47.3116439Z 2025-05-07T19:43:47.3116588Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:47.3116983Z python-3.10.16 | 26.9 MB | | 0% 2025-05-07T19:43:47.3117218Z 2025-05-07T19:43:47.3117557Z setuptools-78.1.1 | 1.7 MB | | 0%  2025-05-07T19:43:47.3117825Z 2025-05-07T19:43:47.3117829Z 2025-05-07T19:43:47.3118058Z ca-certificates-2025 | 129 KB | | 0%  2025-05-07T19:43:47.3118328Z 2025-05-07T19:43:47.3118331Z 2025-05-07T19:43:47.3118335Z 2025-05-07T19:43:47.3129715Z wheel-0.45.1 | 115 KB | | 0%  2025-05-07T19:43:47.3129966Z 2025-05-07T19:43:47.3129970Z 2025-05-07T19:43:47.3129974Z 2025-05-07T19:43:47.3130005Z 2025-05-07T19:43:47.3174937Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:47.3175256Z 2025-05-07T19:43:47.3175261Z 2025-05-07T19:43:47.3175280Z 2025-05-07T19:43:47.3175284Z 2025-05-07T19:43:47.3175288Z 2025-05-07T19:43:47.3559319Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:47.3559718Z 2025-05-07T19:43:47.3559725Z 2025-05-07T19:43:47.3559729Z 2025-05-07T19:43:47.3559735Z 2025-05-07T19:43:47.3649210Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:47.3650108Z 2025-05-07T19:43:47.3650123Z 2025-05-07T19:43:47.3650136Z 2025-05-07T19:43:47.3670917Z wheel-0.45.1 | 115 KB | ########## | 100%  2025-05-07T19:43:47.3671260Z 2025-05-07T19:43:47.3671266Z 2025-05-07T19:43:47.3762977Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:47.3763349Z 2025-05-07T19:43:47.3763355Z 2025-05-07T19:43:47.3763361Z 2025-05-07T19:43:47.3763367Z 2025-05-07T19:43:47.3763373Z 2025-05-07T19:43:47.3997450Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:47.3997784Z 2025-05-07T19:43:47.3997816Z 2025-05-07T19:43:47.4106004Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:47.4125256Z python-3.10.16 | 26.9 MB | 6 | 7% 2025-05-07T19:43:47.4125602Z 2025-05-07T19:43:47.4125606Z 2025-05-07T19:43:47.4125610Z 2025-05-07T19:43:47.4128057Z 2025-05-07T19:43:47.4144854Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:47.4145177Z 2025-05-07T19:43:47.4145182Z 2025-05-07T19:43:47.4145186Z 2025-05-07T19:43:47.4145191Z 2025-05-07T19:43:47.4145207Z 2025-05-07T19:43:47.4252340Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:47.4252721Z 2025-05-07T19:43:47.4252957Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:47.4253247Z 2025-05-07T19:43:47.4386370Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:47.4387216Z 2025-05-07T19:43:47.4387231Z 2025-05-07T19:43:47.4387242Z 2025-05-07T19:43:47.4387902Z wheel-0.45.1 | 115 KB | ########## | 100%  2025-05-07T19:43:47.4388720Z 2025-05-07T19:43:47.4388724Z 2025-05-07T19:43:47.4388727Z 2025-05-07T19:43:47.5107638Z wheel-0.45.1 | 115 KB | ########## | 100%  2025-05-07T19:43:47.6474686Z python-3.10.16 | 26.9 MB | ######## | 80% 2025-05-07T19:43:47.6681249Z python-3.10.16 | 26.9 MB | ########## | 100% 2025-05-07T19:43:47.6682041Z 2025-05-07T19:43:48.1561534Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:48.1566878Z python-3.10.16 | 26.9 MB | ########## | 100% 2025-05-07T19:43:48.1567947Z 2025-05-07T19:43:48.1568546Z 2025-05-07T19:43:48.1569057Z  2025-05-07T19:43:48.1569696Z 2025-05-07T19:43:48.1569708Z 2025-05-07T19:43:48.1570187Z  2025-05-07T19:43:48.1570800Z 2025-05-07T19:43:48.1570811Z 2025-05-07T19:43:48.1570822Z 2025-05-07T19:43:48.1571356Z  2025-05-07T19:43:48.1571825Z 2025-05-07T19:43:48.1571829Z 2025-05-07T19:43:48.1571841Z 2025-05-07T19:43:48.1571845Z 2025-05-07T19:43:48.1572033Z  2025-05-07T19:43:48.1572271Z 2025-05-07T19:43:48.1572275Z 2025-05-07T19:43:48.1572278Z 2025-05-07T19:43:48.1572281Z 2025-05-07T19:43:48.1572284Z 2025-05-07T19:43:48.1572473Z  done 2025-05-07T19:43:48.3632085Z Preparing transaction: - \ done 2025-05-07T19:43:49.5264165Z Verifying transaction: / - \ | / - \ | / - \ done 2025-05-07T19:43:51.6427968Z Executing transaction: / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:51.6475938Z # 2025-05-07T19:43:51.6476335Z # To activate this environment, use 2025-05-07T19:43:51.6477690Z # 2025-05-07T19:43:51.6478126Z # $ conda activate build_binary 2025-05-07T19:43:51.6478521Z # 2025-05-07T19:43:51.6478742Z # To deactivate an active environment, use 2025-05-07T19:43:51.6479089Z # 2025-05-07T19:43:51.6479294Z # $ conda deactivate 2025-05-07T19:43:51.6479459Z 2025-05-07T19:43:51.7487422Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:51.7518593Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:54.5795279Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:54.5797274Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (25.1) 2025-05-07T19:43:54.5797901Z Collecting pip 2025-05-07T19:43:54.5798252Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:54.5798685Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:54.5799893Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 86.1 MB/s eta 0:00:00 2025-05-07T19:43:54.5800256Z Installing collected packages: pip 2025-05-07T19:43:54.5800561Z Attempting uninstall: pip 2025-05-07T19:43:54.5800832Z Found existing installation: pip 25.1 2025-05-07T19:43:54.5801073Z 2025-05-07T19:43:54.5801171Z Uninstalling pip-25.1: 2025-05-07T19:43:54.5801432Z Successfully uninstalled pip-25.1 2025-05-07T19:43:54.5801748Z Successfully installed pip-25.1.1 2025-05-07T19:43:54.5801933Z 2025-05-07T19:43:54.6393307Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:54.6417685Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:55.3061588Z Channels: 2025-05-07T19:43:55.3062249Z - conda-forge 2025-05-07T19:43:55.3062509Z Platform: linux-64 2025-05-07T19:44:04.9061386Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:44:06.7331054Z Solving environment: | / - \ | done 2025-05-07T19:44:06.7781611Z 2025-05-07T19:44:06.7782411Z ## Package Plan ## 2025-05-07T19:44:06.7782658Z 2025-05-07T19:44:06.7782884Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:06.7783230Z 2025-05-07T19:44:06.7783366Z added / updated specs: 2025-05-07T19:44:06.7783643Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:06.7783846Z 2025-05-07T19:44:06.7783854Z 2025-05-07T19:44:06.7783997Z The following packages will be downloaded: 2025-05-07T19:44:06.7784241Z 2025-05-07T19:44:06.7784362Z package | build 2025-05-07T19:44:06.7784750Z ---------------------------|----------------- 2025-05-07T19:44:06.7785129Z cffi-1.17.1 | py310h8deb56e_0 238 KB conda-forge 2025-05-07T19:44:06.7785644Z cryptography-44.0.3 | py310h6c63255_0 1.5 MB conda-forge 2025-05-07T19:44:06.7786134Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:06.7786562Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:06.7787000Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:06.7787420Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:06.7787866Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:06.7788329Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:06.7788766Z python_abi-3.10 | 2_cp310 4 KB conda-forge 2025-05-07T19:44:06.7789246Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:06.7789744Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:06.7790199Z ------------------------------------------------------------ 2025-05-07T19:44:06.7790548Z Total: 6.3 MB 2025-05-07T19:44:06.7790780Z 2025-05-07T19:44:06.7790907Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:06.7791138Z 2025-05-07T19:44:06.7791364Z cffi conda-forge/linux-64::cffi-1.17.1-py310h8deb56e_0 2025-05-07T19:44:06.7791879Z cryptography conda-forge/linux-64::cryptography-44.0.3-py310h6c63255_0 2025-05-07T19:44:06.7792406Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:06.7792904Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:06.7793395Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:06.7793886Z python_abi conda-forge/linux-64::python_abi-3.10-2_cp310 2025-05-07T19:44:06.7796762Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:06.7797417Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:06.7798037Z 2025-05-07T19:44:06.7798156Z The following packages will be UPDATED: 2025-05-07T19:44:06.7798384Z 2025-05-07T19:44:06.7798803Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:06.7799631Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:06.7800314Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:06.7800988Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:06.7801374Z 2025-05-07T19:44:06.7801378Z 2025-05-07T19:44:06.7801382Z 2025-05-07T19:44:06.7801543Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:06.7801923Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:06.7802319Z 2025-05-07T19:44:06.7802767Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:06.7803028Z 2025-05-07T19:44:06.7803032Z 2025-05-07T19:44:06.7805852Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:06.7806116Z 2025-05-07T19:44:06.7806120Z 2025-05-07T19:44:06.7806124Z 2025-05-07T19:44:06.7821471Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:06.7821828Z 2025-05-07T19:44:06.7821836Z 2025-05-07T19:44:06.7821839Z 2025-05-07T19:44:06.7821846Z 2025-05-07T19:44:06.7835218Z cffi-1.17.1 | 238 KB | | 0%  2025-05-07T19:44:06.7835756Z 2025-05-07T19:44:06.7835863Z 2025-05-07T19:44:06.7835869Z 2025-05-07T19:44:06.7835901Z 2025-05-07T19:44:06.7835964Z 2025-05-07T19:44:06.7836312Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:06.7836657Z 2025-05-07T19:44:06.7836683Z 2025-05-07T19:44:06.7836720Z 2025-05-07T19:44:06.7836726Z 2025-05-07T19:44:06.7836731Z 2025-05-07T19:44:06.7836736Z 2025-05-07T19:44:06.7837029Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:06.7837326Z 2025-05-07T19:44:06.7837329Z 2025-05-07T19:44:06.7837363Z 2025-05-07T19:44:06.7837366Z 2025-05-07T19:44:06.7837369Z 2025-05-07T19:44:06.7837373Z 2025-05-07T19:44:06.7837383Z 2025-05-07T19:44:06.7838544Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:06.7838969Z 2025-05-07T19:44:06.7838974Z 2025-05-07T19:44:06.7838979Z 2025-05-07T19:44:06.7838983Z 2025-05-07T19:44:06.7838987Z 2025-05-07T19:44:06.7838990Z 2025-05-07T19:44:06.7838993Z 2025-05-07T19:44:06.7839012Z 2025-05-07T19:44:06.7839525Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:06.7839882Z 2025-05-07T19:44:06.7839888Z 2025-05-07T19:44:06.7839893Z 2025-05-07T19:44:06.7839897Z 2025-05-07T19:44:06.7839932Z 2025-05-07T19:44:06.7839936Z 2025-05-07T19:44:06.7839947Z 2025-05-07T19:44:06.7839950Z 2025-05-07T19:44:06.7839954Z 2025-05-07T19:44:06.7840755Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:06.7841092Z 2025-05-07T19:44:06.7841110Z 2025-05-07T19:44:06.7841114Z 2025-05-07T19:44:06.7841117Z 2025-05-07T19:44:06.7841121Z 2025-05-07T19:44:06.7841125Z 2025-05-07T19:44:06.7841128Z 2025-05-07T19:44:06.7841131Z 2025-05-07T19:44:06.7841135Z 2025-05-07T19:44:06.7841138Z 2025-05-07T19:44:06.8697058Z python_abi-3.10 | 4 KB | | 0%  2025-05-07T19:44:06.8697404Z 2025-05-07T19:44:06.8697409Z 2025-05-07T19:44:06.8697413Z 2025-05-07T19:44:06.8762243Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:06.8762589Z 2025-05-07T19:44:06.8762594Z 2025-05-07T19:44:06.8762598Z 2025-05-07T19:44:06.8762601Z 2025-05-07T19:44:06.8800048Z cffi-1.17.1 | 238 KB | ########## | 100%  2025-05-07T19:44:06.8800406Z 2025-05-07T19:44:06.8800411Z 2025-05-07T19:44:06.8908439Z libgcc-15.1.0 | 810 KB | 5 | 6%  2025-05-07T19:44:06.8908769Z 2025-05-07T19:44:06.9005138Z cryptography-44.0.3 | 1.5 MB | 1 | 1%  2025-05-07T19:44:06.9005455Z 2025-05-07T19:44:06.9005460Z 2025-05-07T19:44:06.9068223Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:06.9068557Z 2025-05-07T19:44:06.9068562Z 2025-05-07T19:44:06.9068566Z 2025-05-07T19:44:06.9068569Z 2025-05-07T19:44:06.9068573Z 2025-05-07T19:44:06.9069112Z 2025-05-07T19:44:06.9118083Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:06.9118442Z 2025-05-07T19:44:06.9118447Z 2025-05-07T19:44:06.9118450Z 2025-05-07T19:44:06.9118454Z 2025-05-07T19:44:06.9118457Z 2025-05-07T19:44:06.9118461Z 2025-05-07T19:44:06.9145572Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:06.9146323Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:06.9221366Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:06.9221703Z 2025-05-07T19:44:06.9221708Z 2025-05-07T19:44:06.9221712Z 2025-05-07T19:44:06.9221715Z 2025-05-07T19:44:06.9221719Z 2025-05-07T19:44:06.9253099Z pyopenssl-25.0.0 | 120 KB | #3 | 13%  2025-05-07T19:44:06.9254051Z 2025-05-07T19:44:06.9254065Z 2025-05-07T19:44:06.9254075Z 2025-05-07T19:44:06.9254086Z 2025-05-07T19:44:06.9254097Z 2025-05-07T19:44:06.9329276Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:06.9330181Z 2025-05-07T19:44:06.9361327Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:06.9362185Z 2025-05-07T19:44:06.9362198Z 2025-05-07T19:44:06.9362210Z 2025-05-07T19:44:06.9362870Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:06.9363521Z 2025-05-07T19:44:06.9363524Z 2025-05-07T19:44:06.9363528Z 2025-05-07T19:44:06.9409395Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:06.9410271Z 2025-05-07T19:44:06.9410284Z 2025-05-07T19:44:06.9410325Z 2025-05-07T19:44:06.9410336Z 2025-05-07T19:44:06.9410988Z cffi-1.17.1 | 238 KB | ########## | 100%  2025-05-07T19:44:06.9411718Z 2025-05-07T19:44:06.9411730Z 2025-05-07T19:44:06.9411741Z 2025-05-07T19:44:06.9411752Z 2025-05-07T19:44:06.9458794Z cffi-1.17.1 | 238 KB | ########## | 100%  2025-05-07T19:44:06.9459841Z 2025-05-07T19:44:06.9459856Z 2025-05-07T19:44:06.9459901Z 2025-05-07T19:44:06.9459912Z 2025-05-07T19:44:06.9459922Z 2025-05-07T19:44:06.9459933Z 2025-05-07T19:44:06.9459943Z 2025-05-07T19:44:06.9467526Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:06.9468478Z 2025-05-07T19:44:06.9468490Z 2025-05-07T19:44:06.9468501Z 2025-05-07T19:44:06.9468511Z 2025-05-07T19:44:06.9468521Z 2025-05-07T19:44:06.9468531Z 2025-05-07T19:44:06.9468573Z 2025-05-07T19:44:06.9468584Z 2025-05-07T19:44:06.9490774Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:06.9491660Z 2025-05-07T19:44:06.9491664Z 2025-05-07T19:44:06.9491669Z 2025-05-07T19:44:06.9491672Z 2025-05-07T19:44:06.9491675Z 2025-05-07T19:44:06.9491679Z 2025-05-07T19:44:06.9491683Z 2025-05-07T19:44:06.9491686Z 2025-05-07T19:44:06.9491995Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:06.9492303Z 2025-05-07T19:44:06.9492307Z 2025-05-07T19:44:06.9492311Z 2025-05-07T19:44:06.9492314Z 2025-05-07T19:44:06.9492317Z 2025-05-07T19:44:06.9492321Z 2025-05-07T19:44:06.9492446Z 2025-05-07T19:44:06.9714855Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:06.9715349Z 2025-05-07T19:44:06.9715355Z 2025-05-07T19:44:06.9715359Z 2025-05-07T19:44:06.9715366Z 2025-05-07T19:44:06.9715369Z 2025-05-07T19:44:06.9715376Z 2025-05-07T19:44:06.9715420Z 2025-05-07T19:44:06.9715425Z 2025-05-07T19:44:06.9715431Z 2025-05-07T19:44:06.9728286Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:06.9728604Z 2025-05-07T19:44:06.9728607Z 2025-05-07T19:44:06.9728627Z 2025-05-07T19:44:06.9728630Z 2025-05-07T19:44:06.9728634Z 2025-05-07T19:44:06.9728637Z 2025-05-07T19:44:06.9728641Z 2025-05-07T19:44:06.9728644Z 2025-05-07T19:44:06.9728648Z 2025-05-07T19:44:06.9738953Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:06.9739454Z 2025-05-07T19:44:06.9739642Z 2025-05-07T19:44:06.9935817Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:06.9936113Z 2025-05-07T19:44:06.9936258Z 2025-05-07T19:44:06.9936264Z 2025-05-07T19:44:06.9936276Z 2025-05-07T19:44:06.9936305Z 2025-05-07T19:44:07.0105417Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:07.0105737Z 2025-05-07T19:44:07.0105745Z 2025-05-07T19:44:07.0106019Z 2025-05-07T19:44:07.0106025Z 2025-05-07T19:44:07.0106029Z 2025-05-07T19:44:07.0106043Z 2025-05-07T19:44:07.0106372Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:07.0106656Z 2025-05-07T19:44:07.0106660Z 2025-05-07T19:44:07.0106663Z 2025-05-07T19:44:07.0106666Z 2025-05-07T19:44:07.0106670Z 2025-05-07T19:44:07.0106673Z 2025-05-07T19:44:07.0245402Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:07.0245999Z 2025-05-07T19:44:07.0246200Z 2025-05-07T19:44:07.0246207Z 2025-05-07T19:44:07.0246214Z 2025-05-07T19:44:07.0246221Z 2025-05-07T19:44:07.0246229Z 2025-05-07T19:44:07.0246236Z 2025-05-07T19:44:07.0246242Z 2025-05-07T19:44:07.0375915Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:07.0376434Z 2025-05-07T19:44:07.0376442Z 2025-05-07T19:44:07.0376447Z 2025-05-07T19:44:07.0376454Z 2025-05-07T19:44:07.0376459Z 2025-05-07T19:44:07.0376466Z 2025-05-07T19:44:07.0376618Z 2025-05-07T19:44:07.0703436Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:07.0704455Z 2025-05-07T19:44:07.0704543Z 2025-05-07T19:44:07.0704556Z 2025-05-07T19:44:07.0704567Z 2025-05-07T19:44:07.0704578Z 2025-05-07T19:44:07.0704588Z 2025-05-07T19:44:07.0704599Z 2025-05-07T19:44:07.0704610Z 2025-05-07T19:44:07.0704620Z 2025-05-07T19:44:07.0785949Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:07.0786330Z 2025-05-07T19:44:07.0786335Z 2025-05-07T19:44:07.0786338Z 2025-05-07T19:44:07.0786342Z 2025-05-07T19:44:07.0786345Z 2025-05-07T19:44:07.0786349Z 2025-05-07T19:44:07.0786352Z 2025-05-07T19:44:07.0786356Z 2025-05-07T19:44:07.0786359Z 2025-05-07T19:44:07.0786363Z 2025-05-07T19:44:07.0793719Z python_abi-3.10 | 4 KB | ########## | 100%  2025-05-07T19:44:07.0794656Z 2025-05-07T19:44:07.0794671Z 2025-05-07T19:44:07.0794681Z 2025-05-07T19:44:07.0794727Z 2025-05-07T19:44:07.0794738Z 2025-05-07T19:44:07.0794770Z 2025-05-07T19:44:07.0794780Z 2025-05-07T19:44:07.0794790Z 2025-05-07T19:44:07.0794816Z 2025-05-07T19:44:07.0795173Z 2025-05-07T19:44:07.0884204Z python_abi-3.10 | 4 KB | ########## | 100%  2025-05-07T19:44:07.0884835Z 2025-05-07T19:44:07.0884839Z 2025-05-07T19:44:07.0884843Z 2025-05-07T19:44:07.0884847Z 2025-05-07T19:44:07.0884850Z 2025-05-07T19:44:07.0884853Z 2025-05-07T19:44:07.0884857Z 2025-05-07T19:44:07.0884860Z 2025-05-07T19:44:07.0884864Z 2025-05-07T19:44:07.0884867Z 2025-05-07T19:44:07.1011274Z python_abi-3.10 | 4 KB | ########## | 100%  2025-05-07T19:44:07.1344641Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:07.1345816Z 2025-05-07T19:44:07.1346189Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:07.1346477Z 2025-05-07T19:44:07.1353422Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:07.1353858Z 2025-05-07T19:44:07.1354079Z 2025-05-07T19:44:07.1354564Z  2025-05-07T19:44:07.1354787Z 2025-05-07T19:44:07.1354791Z 2025-05-07T19:44:07.1355032Z  2025-05-07T19:44:07.1355339Z 2025-05-07T19:44:07.1355344Z 2025-05-07T19:44:07.1355348Z 2025-05-07T19:44:07.1355532Z  2025-05-07T19:44:07.1355759Z 2025-05-07T19:44:07.1355763Z 2025-05-07T19:44:07.1355766Z 2025-05-07T19:44:07.1355769Z 2025-05-07T19:44:07.1355982Z  2025-05-07T19:44:07.1356208Z 2025-05-07T19:44:07.1356213Z 2025-05-07T19:44:07.1356217Z 2025-05-07T19:44:07.1356220Z 2025-05-07T19:44:07.1356224Z 2025-05-07T19:44:07.1356439Z  2025-05-07T19:44:07.1357980Z 2025-05-07T19:44:07.1357984Z 2025-05-07T19:44:07.1357988Z 2025-05-07T19:44:07.1357991Z 2025-05-07T19:44:07.1357995Z 2025-05-07T19:44:07.1358007Z 2025-05-07T19:44:07.1358229Z  2025-05-07T19:44:07.1358489Z 2025-05-07T19:44:07.1358493Z 2025-05-07T19:44:07.1358496Z 2025-05-07T19:44:07.1358499Z 2025-05-07T19:44:07.1358503Z 2025-05-07T19:44:07.1358506Z 2025-05-07T19:44:07.1358510Z 2025-05-07T19:44:07.1358703Z  2025-05-07T19:44:07.1358973Z 2025-05-07T19:44:07.1358977Z 2025-05-07T19:44:07.1358980Z 2025-05-07T19:44:07.1358984Z 2025-05-07T19:44:07.1358987Z 2025-05-07T19:44:07.1358991Z 2025-05-07T19:44:07.1358994Z 2025-05-07T19:44:07.1358997Z 2025-05-07T19:44:07.1359192Z  2025-05-07T19:44:07.1359434Z 2025-05-07T19:44:07.1359463Z 2025-05-07T19:44:07.1359466Z 2025-05-07T19:44:07.1359474Z 2025-05-07T19:44:07.1359478Z 2025-05-07T19:44:07.1359481Z 2025-05-07T19:44:07.1359484Z 2025-05-07T19:44:07.1359488Z 2025-05-07T19:44:07.1359495Z 2025-05-07T19:44:07.1359694Z  2025-05-07T19:44:07.1359929Z 2025-05-07T19:44:07.1359933Z 2025-05-07T19:44:07.1359962Z 2025-05-07T19:44:07.1359965Z 2025-05-07T19:44:07.1359969Z 2025-05-07T19:44:07.1359972Z 2025-05-07T19:44:07.1359975Z 2025-05-07T19:44:07.1359979Z 2025-05-07T19:44:07.1359982Z 2025-05-07T19:44:07.1359985Z 2025-05-07T19:44:07.1360208Z  done 2025-05-07T19:44:07.2366821Z Preparing transaction: - done 2025-05-07T19:44:07.4376642Z Verifying transaction: | / done 2025-05-07T19:44:08.8404575Z Executing transaction: \ | / - \ | / - \ | / - \ | done 2025-05-07T19:44:08.9683398Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:10.6760496Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:10.6778983Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:10.6811583Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:11.3421427Z Channels: 2025-05-07T19:44:11.3422440Z - conda-forge 2025-05-07T19:44:11.3423133Z Platform: linux-64 2025-05-07T19:44:14.3474548Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:14.7676188Z Solving environment: \ done 2025-05-07T19:44:14.8154112Z 2025-05-07T19:44:14.8154572Z ## Package Plan ## 2025-05-07T19:44:14.8154974Z 2025-05-07T19:44:14.8155281Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:14.8155641Z 2025-05-07T19:44:14.8155777Z added / updated specs: 2025-05-07T19:44:14.8156058Z - libxcrypt 2025-05-07T19:44:14.8156224Z 2025-05-07T19:44:14.8156228Z 2025-05-07T19:44:14.8156360Z The following packages will be downloaded: 2025-05-07T19:44:14.8156615Z 2025-05-07T19:44:14.8156765Z package | build 2025-05-07T19:44:14.8157391Z ---------------------------|----------------- 2025-05-07T19:44:14.8157834Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:14.8158385Z ------------------------------------------------------------ 2025-05-07T19:44:14.8158770Z Total: 98 KB 2025-05-07T19:44:14.8158994Z 2025-05-07T19:44:14.8159153Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:14.8159388Z 2025-05-07T19:44:14.8159642Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:14.8159945Z 2025-05-07T19:44:14.8159974Z 2025-05-07T19:44:14.8159977Z 2025-05-07T19:44:14.8160129Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:14.9633784Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:14.9655005Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:14.9754573Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:14.9755210Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:14.9755606Z 2025-05-07T19:44:14.9755909Z done 2025-05-07T19:44:15.0763534Z Preparing transaction: / done 2025-05-07T19:44:15.1771288Z Verifying transaction: \ done 2025-05-07T19:44:15.2782730Z Executing transaction: / done 2025-05-07T19:44:18.5744004Z [SETUP] Copying over ... 2025-05-07T19:44:18.5744925Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.10/crypt.h 2025-05-07T19:44:18.5745542Z 2025-05-07T19:44:18.5781695Z 2025-05-07T19:44:20.1877471Z [SETUP] Installed Python version: Python 3.10.16 2025-05-07T19:44:20.1878034Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:20.1958461Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:20.1958986Z . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:20.1959628Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:20.1959986Z env: 2025-05-07T19:44:20.1960222Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:20.1960564Z BUILD_ENV: build_binary 2025-05-07T19:44:20.1960824Z BUILD_TARGET: default 2025-05-07T19:44:20.1961103Z BUILD_VARIANT: cuda 2025-05-07T19:44:20.1961362Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:20.1961654Z ##[endgroup] 2025-05-07T19:44:20.6361351Z ################################################################################ 2025-05-07T19:44:20.6362426Z # Install C/C++ Compilers 2025-05-07T19:44:20.6363164Z # 2025-05-07T19:44:20.6378157Z # [2025-05-07T19:44:20.637Z] + install_cxx_compiler build_binary clang 2025-05-07T19:44:20.6378729Z ################################################################################ 2025-05-07T19:44:20.6378964Z 2025-05-07T19:44:20.6397094Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:20.7261428Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:20.7271002Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:20.7294128Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:21.3980717Z Channels: 2025-05-07T19:44:21.3980973Z - conda-forge 2025-05-07T19:44:21.3981742Z Platform: linux-64 2025-05-07T19:44:24.4441906Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:24.8704699Z Solving environment: \ | done 2025-05-07T19:44:24.9186912Z 2025-05-07T19:44:24.9187301Z ## Package Plan ## 2025-05-07T19:44:24.9187510Z 2025-05-07T19:44:24.9187735Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:24.9188086Z 2025-05-07T19:44:24.9188229Z added / updated specs: 2025-05-07T19:44:24.9188534Z - sysroot_linux-64=2.17 2025-05-07T19:44:24.9188722Z 2025-05-07T19:44:24.9188763Z 2025-05-07T19:44:24.9188903Z The following packages will be downloaded: 2025-05-07T19:44:24.9189137Z 2025-05-07T19:44:24.9189263Z package | build 2025-05-07T19:44:24.9189643Z ---------------------------|----------------- 2025-05-07T19:44:24.9190125Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:24.9190658Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:24.9191131Z ------------------------------------------------------------ 2025-05-07T19:44:24.9191507Z Total: 15.4 MB 2025-05-07T19:44:24.9191976Z 2025-05-07T19:44:24.9192099Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:24.9192322Z 2025-05-07T19:44:24.9192621Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:24.9193206Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:24.9193853Z 2025-05-07T19:44:24.9193856Z 2025-05-07T19:44:24.9193880Z 2025-05-07T19:44:24.9194024Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:24.9194401Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:24.9194651Z 2025-05-07T19:44:25.1225029Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:25.1961021Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:25.1961351Z 2025-05-07T19:44:25.2233171Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:25.2836710Z sysroot_linux-64-2.1 | 14.5 MB | #####3 | 53% 2025-05-07T19:44:25.2837288Z 2025-05-07T19:44:25.3278191Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:25.3982438Z sysroot_linux-64-2.1 | 14.5 MB | #########5 | 96% 2025-05-07T19:44:25.5071234Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:25.5071586Z 2025-05-07T19:44:25.5073434Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:25.5073730Z 2025-05-07T19:44:25.9086584Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:25.9087279Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:25.9087666Z 2025-05-07T19:44:25.9087882Z 2025-05-07T19:44:25.9088532Z  done 2025-05-07T19:44:26.0099243Z Preparing transaction: - done 2025-05-07T19:44:26.2108816Z Verifying transaction: | / done 2025-05-07T19:44:26.3118196Z Executing transaction: \ done 2025-05-07T19:44:26.3972174Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:28.0331458Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:28.0332187Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:28.0348117Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:28.0382788Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:28.7229249Z Channels: 2025-05-07T19:44:28.7229631Z - conda-forge 2025-05-07T19:44:28.7229906Z Platform: linux-64 2025-05-07T19:44:31.8725110Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:33.0056311Z Solving environment: \ | / done 2025-05-07T19:44:33.0582796Z 2025-05-07T19:44:33.0583746Z ## Package Plan ## 2025-05-07T19:44:33.0584004Z 2025-05-07T19:44:33.0584244Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:33.0584582Z 2025-05-07T19:44:33.0584703Z added / updated specs: 2025-05-07T19:44:33.0584992Z - gxx_linux-64=11.4.0 2025-05-07T19:44:33.0585171Z 2025-05-07T19:44:33.0585176Z 2025-05-07T19:44:33.0585313Z The following packages will be downloaded: 2025-05-07T19:44:33.0585546Z 2025-05-07T19:44:33.0585704Z package | build 2025-05-07T19:44:33.0586045Z ---------------------------|----------------- 2025-05-07T19:44:33.0586499Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:33.0586997Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:33.0587492Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:33.0587947Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:33.0588418Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:33.0588870Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:33.0589332Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:33.0589828Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:33.0590330Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:33.0590797Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:33.0591627Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:33.0592145Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:33.0592578Z ------------------------------------------------------------ 2025-05-07T19:44:33.0592921Z Total: 91.6 MB 2025-05-07T19:44:33.0593138Z 2025-05-07T19:44:33.0593281Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:33.0593510Z 2025-05-07T19:44:33.0593821Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:33.0594431Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:33.0595171Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:33.0595738Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:33.0596292Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:33.0596816Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:33.0597383Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:33.0597985Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:33.0598556Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:33.0599177Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:33.0599574Z 2025-05-07T19:44:33.0599745Z The following packages will be UPDATED: 2025-05-07T19:44:33.0599978Z 2025-05-07T19:44:33.0600325Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:33.0601153Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:33.0601611Z 2025-05-07T19:44:33.0601615Z 2025-05-07T19:44:33.0601619Z 2025-05-07T19:44:33.0601816Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:33.0602234Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:33.0602519Z 2025-05-07T19:44:33.0602961Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:33.0603228Z 2025-05-07T19:44:33.0603232Z 2025-05-07T19:44:33.0603476Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:33.0603793Z 2025-05-07T19:44:33.0603798Z 2025-05-07T19:44:33.0603809Z 2025-05-07T19:44:33.0604058Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:33.0604382Z 2025-05-07T19:44:33.0604385Z 2025-05-07T19:44:33.0604389Z 2025-05-07T19:44:33.0609159Z 2025-05-07T19:44:33.0616239Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:33.0616594Z 2025-05-07T19:44:33.0616644Z 2025-05-07T19:44:33.0616676Z 2025-05-07T19:44:33.0616691Z 2025-05-07T19:44:33.0626055Z 2025-05-07T19:44:33.0627203Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:33.0627514Z 2025-05-07T19:44:33.0627527Z 2025-05-07T19:44:33.0627530Z 2025-05-07T19:44:33.0627534Z 2025-05-07T19:44:33.0627571Z 2025-05-07T19:44:33.0627574Z 2025-05-07T19:44:33.0630475Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:33.0630791Z 2025-05-07T19:44:33.0630795Z 2025-05-07T19:44:33.0630798Z 2025-05-07T19:44:33.0630807Z 2025-05-07T19:44:33.0630811Z 2025-05-07T19:44:33.0630814Z 2025-05-07T19:44:33.0630848Z 2025-05-07T19:44:33.0631586Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:33.0631886Z 2025-05-07T19:44:33.0631897Z 2025-05-07T19:44:33.0631901Z 2025-05-07T19:44:33.0631904Z 2025-05-07T19:44:33.0631914Z 2025-05-07T19:44:33.0631918Z 2025-05-07T19:44:33.0631922Z 2025-05-07T19:44:33.0633392Z 2025-05-07T19:44:33.0633697Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:33.0634011Z 2025-05-07T19:44:33.0634015Z 2025-05-07T19:44:33.0634020Z 2025-05-07T19:44:33.0634025Z 2025-05-07T19:44:33.0634029Z 2025-05-07T19:44:33.0634032Z 2025-05-07T19:44:33.0634036Z 2025-05-07T19:44:33.0634039Z 2025-05-07T19:44:33.0634074Z 2025-05-07T19:44:33.0634345Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:33.0634647Z 2025-05-07T19:44:33.0634650Z 2025-05-07T19:44:33.0634654Z 2025-05-07T19:44:33.0634657Z 2025-05-07T19:44:33.0634660Z 2025-05-07T19:44:33.0634664Z 2025-05-07T19:44:33.0634668Z 2025-05-07T19:44:33.0634701Z 2025-05-07T19:44:33.0634704Z 2025-05-07T19:44:33.0634708Z 2025-05-07T19:44:33.0634982Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:33.0635394Z 2025-05-07T19:44:33.0635399Z 2025-05-07T19:44:33.0635411Z 2025-05-07T19:44:33.0635415Z 2025-05-07T19:44:33.0635425Z 2025-05-07T19:44:33.0635455Z 2025-05-07T19:44:33.0635459Z 2025-05-07T19:44:33.0635462Z 2025-05-07T19:44:33.0635466Z 2025-05-07T19:44:33.0635469Z 2025-05-07T19:44:33.0635472Z 2025-05-07T19:44:33.2269708Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:33.2270051Z 2025-05-07T19:44:33.2270056Z 2025-05-07T19:44:33.2270061Z 2025-05-07T19:44:33.2323899Z 2025-05-07T19:44:33.3668003Z libstdcxx-15.1.0 | 3.7 MB | 3 | 3%  2025-05-07T19:44:33.3668367Z 2025-05-07T19:44:33.3668374Z 2025-05-07T19:44:33.3668380Z 2025-05-07T19:44:33.3668384Z 2025-05-07T19:44:33.4162101Z libstdcxx-15.1.0 | 3.7 MB | 5 | 5%  2025-05-07T19:44:33.4162519Z 2025-05-07T19:44:33.4162524Z 2025-05-07T19:44:33.4162529Z 2025-05-07T19:44:33.4364759Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:33.4365102Z 2025-05-07T19:44:33.4365106Z 2025-05-07T19:44:33.4443506Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:33.4452188Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:33.4452509Z 2025-05-07T19:44:33.4683806Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:33.4684642Z 2025-05-07T19:44:33.4684656Z 2025-05-07T19:44:33.4684668Z 2025-05-07T19:44:33.4684681Z 2025-05-07T19:44:33.4685495Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:33.4685777Z 2025-05-07T19:44:33.4685783Z 2025-05-07T19:44:33.4685788Z 2025-05-07T19:44:33.4685792Z 2025-05-07T19:44:33.4964186Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:33.4965106Z 2025-05-07T19:44:33.4965123Z 2025-05-07T19:44:33.4965138Z 2025-05-07T19:44:33.5172591Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:33.5172954Z 2025-05-07T19:44:33.5173006Z 2025-05-07T19:44:33.5173011Z 2025-05-07T19:44:33.5173017Z 2025-05-07T19:44:33.5173022Z 2025-05-07T19:44:33.5323852Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:33.5324441Z 2025-05-07T19:44:33.5324463Z 2025-05-07T19:44:33.5324468Z 2025-05-07T19:44:33.5324472Z 2025-05-07T19:44:33.5324477Z 2025-05-07T19:44:33.5324481Z 2025-05-07T19:44:33.5400649Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:33.5401683Z 2025-05-07T19:44:33.5401697Z 2025-05-07T19:44:33.5453042Z libstdcxx-devel_linu | 11.1 MB | ######2 | 63%  2025-05-07T19:44:33.5454298Z gcc_impl_linux-64-11 | 53.0 MB | #6 | 17% 2025-05-07T19:44:33.5454710Z 2025-05-07T19:44:33.6024130Z gxx_impl_linux-64-11 | 11.2 MB | ####6 | 47%  2025-05-07T19:44:33.6024447Z 2025-05-07T19:44:33.6024451Z 2025-05-07T19:44:33.6024457Z 2025-05-07T19:44:33.6024461Z 2025-05-07T19:44:33.6024482Z 2025-05-07T19:44:33.6024486Z 2025-05-07T19:44:33.6244542Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:33.6245364Z 2025-05-07T19:44:33.6245369Z 2025-05-07T19:44:33.6245372Z 2025-05-07T19:44:33.6245376Z 2025-05-07T19:44:33.6245380Z 2025-05-07T19:44:33.6245698Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:33.6245998Z 2025-05-07T19:44:33.6246001Z 2025-05-07T19:44:33.6246005Z 2025-05-07T19:44:33.6246008Z 2025-05-07T19:44:33.6246012Z 2025-05-07T19:44:33.6375717Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:33.6376098Z 2025-05-07T19:44:33.6376121Z 2025-05-07T19:44:33.6376128Z 2025-05-07T19:44:33.6376135Z 2025-05-07T19:44:33.6461318Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:33.6628246Z gcc_impl_linux-64-11 | 53.0 MB | ##9 | 30% 2025-05-07T19:44:33.6629061Z 2025-05-07T19:44:33.6629075Z 2025-05-07T19:44:33.6629086Z 2025-05-07T19:44:33.6629097Z 2025-05-07T19:44:33.6629108Z 2025-05-07T19:44:33.6629624Z 2025-05-07T19:44:33.6629638Z 2025-05-07T19:44:33.6764097Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:33.6764640Z 2025-05-07T19:44:33.6764663Z 2025-05-07T19:44:33.6764667Z 2025-05-07T19:44:33.6764671Z 2025-05-07T19:44:33.6764674Z 2025-05-07T19:44:33.6764677Z 2025-05-07T19:44:33.6764681Z 2025-05-07T19:44:33.6764684Z 2025-05-07T19:44:33.6774744Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:33.6775103Z 2025-05-07T19:44:33.6775126Z 2025-05-07T19:44:33.6775130Z 2025-05-07T19:44:33.6775134Z 2025-05-07T19:44:33.6775137Z 2025-05-07T19:44:33.6775141Z 2025-05-07T19:44:33.6775145Z 2025-05-07T19:44:33.6775149Z 2025-05-07T19:44:33.6808388Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:33.6809365Z 2025-05-07T19:44:33.6809401Z 2025-05-07T19:44:33.6809413Z 2025-05-07T19:44:33.6809423Z 2025-05-07T19:44:33.6809434Z 2025-05-07T19:44:33.6809477Z 2025-05-07T19:44:33.6809488Z 2025-05-07T19:44:33.7062695Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:33.7063068Z 2025-05-07T19:44:33.7063072Z 2025-05-07T19:44:33.7063093Z 2025-05-07T19:44:33.7063097Z 2025-05-07T19:44:33.7063100Z 2025-05-07T19:44:33.7063104Z 2025-05-07T19:44:33.7063384Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:33.7063691Z 2025-05-07T19:44:33.7063694Z 2025-05-07T19:44:33.7063698Z 2025-05-07T19:44:33.7063702Z 2025-05-07T19:44:33.7063705Z 2025-05-07T19:44:33.7063710Z 2025-05-07T19:44:33.7140888Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:33.7141874Z 2025-05-07T19:44:33.7141889Z 2025-05-07T19:44:33.7141900Z 2025-05-07T19:44:33.7141912Z 2025-05-07T19:44:33.7141922Z 2025-05-07T19:44:33.7141932Z 2025-05-07T19:44:33.7141942Z 2025-05-07T19:44:33.7141953Z 2025-05-07T19:44:33.7141983Z 2025-05-07T19:44:33.7141995Z 2025-05-07T19:44:33.7157267Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:33.7158245Z 2025-05-07T19:44:33.7158259Z 2025-05-07T19:44:33.7158270Z 2025-05-07T19:44:33.7158281Z 2025-05-07T19:44:33.7158292Z 2025-05-07T19:44:33.7158324Z 2025-05-07T19:44:33.7158335Z 2025-05-07T19:44:33.7158345Z 2025-05-07T19:44:33.7158356Z 2025-05-07T19:44:33.7158367Z 2025-05-07T19:44:33.7188428Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:33.7189385Z 2025-05-07T19:44:33.7189400Z 2025-05-07T19:44:33.7190114Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:33.7190920Z 2025-05-07T19:44:33.7190931Z 2025-05-07T19:44:33.7252403Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:33.7253302Z 2025-05-07T19:44:33.7253945Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:33.7254353Z 2025-05-07T19:44:33.7266099Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:33.7266865Z 2025-05-07T19:44:33.7266876Z 2025-05-07T19:44:33.7267310Z 2025-05-07T19:44:33.7267321Z 2025-05-07T19:44:33.7267332Z 2025-05-07T19:44:33.7267342Z 2025-05-07T19:44:33.7267352Z 2025-05-07T19:44:33.7267362Z 2025-05-07T19:44:33.7268771Z 2025-05-07T19:44:33.7275869Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:33.7276841Z 2025-05-07T19:44:33.7276845Z 2025-05-07T19:44:33.7276849Z 2025-05-07T19:44:33.7276854Z 2025-05-07T19:44:33.7276857Z 2025-05-07T19:44:33.7276861Z 2025-05-07T19:44:33.7276864Z 2025-05-07T19:44:33.7276867Z 2025-05-07T19:44:33.7276871Z 2025-05-07T19:44:33.7349023Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:33.7349969Z 2025-05-07T19:44:33.7349984Z 2025-05-07T19:44:33.7349995Z 2025-05-07T19:44:33.7350006Z 2025-05-07T19:44:33.7350016Z 2025-05-07T19:44:33.7350026Z 2025-05-07T19:44:33.7350036Z 2025-05-07T19:44:33.7350046Z 2025-05-07T19:44:33.7519536Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:33.7520560Z 2025-05-07T19:44:33.7520575Z 2025-05-07T19:44:33.7520586Z 2025-05-07T19:44:33.7520596Z 2025-05-07T19:44:33.7520607Z 2025-05-07T19:44:33.7520616Z 2025-05-07T19:44:33.7520651Z 2025-05-07T19:44:33.7520661Z 2025-05-07T19:44:33.7520671Z 2025-05-07T19:44:33.7520681Z 2025-05-07T19:44:33.7520692Z 2025-05-07T19:44:33.7521555Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:33.7522848Z 2025-05-07T19:44:33.7522859Z 2025-05-07T19:44:33.7522869Z 2025-05-07T19:44:33.7522880Z 2025-05-07T19:44:33.7522890Z 2025-05-07T19:44:33.7522925Z 2025-05-07T19:44:33.7522935Z 2025-05-07T19:44:33.7522945Z 2025-05-07T19:44:33.7522955Z 2025-05-07T19:44:33.7522964Z 2025-05-07T19:44:33.7522990Z 2025-05-07T19:44:33.7613242Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:33.7614241Z 2025-05-07T19:44:33.7614313Z 2025-05-07T19:44:33.7614326Z 2025-05-07T19:44:33.7614336Z 2025-05-07T19:44:33.7614361Z 2025-05-07T19:44:33.7614373Z 2025-05-07T19:44:33.7614383Z 2025-05-07T19:44:33.7830362Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:33.7831305Z 2025-05-07T19:44:33.7831344Z 2025-05-07T19:44:33.7831355Z 2025-05-07T19:44:33.7831367Z 2025-05-07T19:44:33.7831377Z 2025-05-07T19:44:33.7831387Z 2025-05-07T19:44:33.7831397Z 2025-05-07T19:44:33.7831408Z 2025-05-07T19:44:33.7831418Z 2025-05-07T19:44:33.7831429Z 2025-05-07T19:44:33.7846229Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:33.7847222Z 2025-05-07T19:44:33.7847238Z 2025-05-07T19:44:33.7847249Z 2025-05-07T19:44:33.7847259Z 2025-05-07T19:44:33.7847270Z 2025-05-07T19:44:33.8548975Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:33.8549942Z 2025-05-07T19:44:33.8549957Z 2025-05-07T19:44:33.8550036Z 2025-05-07T19:44:33.8550753Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:33.8551588Z 2025-05-07T19:44:33.8551599Z 2025-05-07T19:44:33.8551609Z 2025-05-07T19:44:33.8783834Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:33.8784505Z 2025-05-07T19:44:33.8784510Z 2025-05-07T19:44:33.8784514Z 2025-05-07T19:44:33.8784518Z 2025-05-07T19:44:33.8784521Z 2025-05-07T19:44:33.8784525Z 2025-05-07T19:44:33.8784528Z 2025-05-07T19:44:33.8784532Z 2025-05-07T19:44:33.8784535Z 2025-05-07T19:44:33.8784830Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:33.8785125Z 2025-05-07T19:44:33.8785129Z 2025-05-07T19:44:33.8785133Z 2025-05-07T19:44:33.8785137Z 2025-05-07T19:44:33.8785140Z 2025-05-07T19:44:33.8785144Z 2025-05-07T19:44:33.8785147Z 2025-05-07T19:44:33.8785151Z 2025-05-07T19:44:33.8785155Z 2025-05-07T19:44:33.8982533Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:33.8982886Z 2025-05-07T19:44:33.8982890Z 2025-05-07T19:44:33.8983173Z 2025-05-07T19:44:33.8983177Z 2025-05-07T19:44:33.8983181Z 2025-05-07T19:44:33.8983184Z 2025-05-07T19:44:33.8983188Z 2025-05-07T19:44:33.8983191Z 2025-05-07T19:44:33.8983195Z 2025-05-07T19:44:33.8983198Z 2025-05-07T19:44:33.8983201Z 2025-05-07T19:44:33.8983544Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:33.8983862Z 2025-05-07T19:44:33.8983866Z 2025-05-07T19:44:33.8983869Z 2025-05-07T19:44:33.8983873Z 2025-05-07T19:44:33.8983876Z 2025-05-07T19:44:33.8983879Z 2025-05-07T19:44:33.8983883Z 2025-05-07T19:44:33.8983886Z 2025-05-07T19:44:33.8983890Z 2025-05-07T19:44:33.8983893Z 2025-05-07T19:44:33.8983928Z 2025-05-07T19:44:33.8993896Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:33.9879854Z gcc_impl_linux-64-11 | 53.0 MB | #### | 41% 2025-05-07T19:44:33.9881173Z 2025-05-07T19:44:33.9998607Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:34.1458559Z gcc_impl_linux-64-11 | 53.0 MB | #####2 | 53% 2025-05-07T19:44:34.1494099Z gcc_impl_linux-64-11 | 53.0 MB | ######2 | 62% 2025-05-07T19:44:34.1494729Z 2025-05-07T19:44:34.1494748Z 2025-05-07T19:44:34.2501803Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:34.3504083Z gcc_impl_linux-64-11 | 53.0 MB | ########5 | 86% 2025-05-07T19:44:34.4812175Z gcc_impl_linux-64-11 | 53.0 MB | #########9 | 100% 2025-05-07T19:44:35.0226700Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:35.0230881Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:35.0231932Z 2025-05-07T19:44:35.0232545Z 2025-05-07T19:44:35.0233272Z  2025-05-07T19:44:35.0233883Z 2025-05-07T19:44:35.0233896Z 2025-05-07T19:44:35.0234422Z  2025-05-07T19:44:35.0235092Z 2025-05-07T19:44:35.0235129Z 2025-05-07T19:44:35.0235141Z 2025-05-07T19:44:35.0235631Z  2025-05-07T19:44:35.0236254Z 2025-05-07T19:44:35.0236265Z 2025-05-07T19:44:35.0236276Z 2025-05-07T19:44:35.0236286Z 2025-05-07T19:44:35.0236807Z  2025-05-07T19:44:35.0237445Z 2025-05-07T19:44:35.0237456Z 2025-05-07T19:44:35.0237466Z 2025-05-07T19:44:35.0237605Z 2025-05-07T19:44:35.0237610Z 2025-05-07T19:44:35.0237796Z  2025-05-07T19:44:35.0238039Z 2025-05-07T19:44:35.0238042Z 2025-05-07T19:44:35.0238046Z 2025-05-07T19:44:35.0238049Z 2025-05-07T19:44:35.0238052Z 2025-05-07T19:44:35.0238056Z 2025-05-07T19:44:35.0238282Z  2025-05-07T19:44:35.0238525Z 2025-05-07T19:44:35.0238537Z 2025-05-07T19:44:35.0238541Z 2025-05-07T19:44:35.0238544Z 2025-05-07T19:44:35.0238551Z 2025-05-07T19:44:35.0238556Z 2025-05-07T19:44:35.0238559Z 2025-05-07T19:44:35.0238746Z  2025-05-07T19:44:35.0238989Z 2025-05-07T19:44:35.0238992Z 2025-05-07T19:44:35.0238996Z 2025-05-07T19:44:35.0238999Z 2025-05-07T19:44:35.0239003Z 2025-05-07T19:44:35.0239006Z 2025-05-07T19:44:35.0239009Z 2025-05-07T19:44:35.0239012Z 2025-05-07T19:44:35.0239204Z  2025-05-07T19:44:35.0239452Z 2025-05-07T19:44:35.0239456Z 2025-05-07T19:44:35.0239460Z 2025-05-07T19:44:35.0239463Z 2025-05-07T19:44:35.0239466Z 2025-05-07T19:44:35.0239470Z 2025-05-07T19:44:35.0239473Z 2025-05-07T19:44:35.0239477Z 2025-05-07T19:44:35.0239480Z 2025-05-07T19:44:35.0239681Z  2025-05-07T19:44:35.0239918Z 2025-05-07T19:44:35.0239940Z 2025-05-07T19:44:35.0239943Z 2025-05-07T19:44:35.0239947Z 2025-05-07T19:44:35.0240220Z 2025-05-07T19:44:35.0240224Z 2025-05-07T19:44:35.0240228Z 2025-05-07T19:44:35.0240232Z 2025-05-07T19:44:35.0240235Z 2025-05-07T19:44:35.0240239Z 2025-05-07T19:44:35.0240444Z  2025-05-07T19:44:35.0240802Z 2025-05-07T19:44:35.0240826Z 2025-05-07T19:44:35.0240830Z 2025-05-07T19:44:35.0240833Z 2025-05-07T19:44:35.0240837Z 2025-05-07T19:44:35.0240840Z 2025-05-07T19:44:35.0240843Z 2025-05-07T19:44:35.0240847Z 2025-05-07T19:44:35.0240850Z 2025-05-07T19:44:35.0240853Z 2025-05-07T19:44:35.0240857Z 2025-05-07T19:44:35.0241069Z  done 2025-05-07T19:44:35.1244952Z Preparing transaction: \ done 2025-05-07T19:44:35.8263214Z Verifying transaction: / - \ | / - \ done 2025-05-07T19:44:35.9275590Z Executing transaction: / done 2025-05-07T19:44:36.0200362Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:39.7654868Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:39.7655539Z 2025-05-07T19:44:39.7672271Z 2025-05-07T19:44:39.7694938Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:39.7695630Z 2025-05-07T19:44:39.7709801Z 2025-05-07T19:44:39.7729310Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:39.7729946Z 2025-05-07T19:44:39.7742893Z 2025-05-07T19:44:39.7763327Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:39.7763975Z 2025-05-07T19:44:39.7782099Z 2025-05-07T19:44:39.7794017Z [INSTALL] Installing Clang (16.0.6, 64) and relevant libraries through Conda ... 2025-05-07T19:44:39.7820220Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y clangxx=16.0.6 libcxx llvm-openmp=16.0.6 compiler-rt=16.0.6 2025-05-07T19:44:40.4968966Z Channels: 2025-05-07T19:44:40.4969628Z - conda-forge 2025-05-07T19:44:40.4970313Z Platform: linux-64 2025-05-07T19:44:43.5717035Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:44.9149256Z Solving environment: \ | / done 2025-05-07T19:44:44.9697022Z 2025-05-07T19:44:44.9697796Z ## Package Plan ## 2025-05-07T19:44:44.9698256Z 2025-05-07T19:44:44.9698978Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:44.9700115Z 2025-05-07T19:44:44.9700397Z added / updated specs: 2025-05-07T19:44:44.9701159Z - clangxx=16.0.6 2025-05-07T19:44:44.9701832Z - compiler-rt=16.0.6 2025-05-07T19:44:44.9702532Z - libcxx 2025-05-07T19:44:44.9703120Z - llvm-openmp=16.0.6 2025-05-07T19:44:44.9703351Z 2025-05-07T19:44:44.9703356Z 2025-05-07T19:44:44.9703487Z The following packages will be downloaded: 2025-05-07T19:44:44.9703742Z 2025-05-07T19:44:44.9703889Z package | build 2025-05-07T19:44:44.9704232Z ---------------------------|----------------- 2025-05-07T19:44:44.9704787Z clang-16.0.6 |default_h9e3a008_14 110 KB conda-forge 2025-05-07T19:44:44.9705256Z clang-16-16.0.6 |default_hb5137d0_14 780 KB conda-forge 2025-05-07T19:44:44.9705751Z clangxx-16.0.6 |default_ha78316a_14 110 KB conda-forge 2025-05-07T19:44:44.9706247Z compiler-rt-16.0.6 | h00ab1b0_2 107 KB conda-forge 2025-05-07T19:44:44.9706923Z compiler-rt_linux-64-16.0.6| h00ab1b0_2 36.0 MB conda-forge 2025-05-07T19:44:44.9707459Z icu-73.2 | h59595ed_0 11.5 MB conda-forge 2025-05-07T19:44:44.9707947Z libclang-cpp16-16.0.6 |default_hb5137d0_14 17.3 MB conda-forge 2025-05-07T19:44:44.9708471Z libcxx-19.1.7 | h2713693_1 1000 KB conda-forge 2025-05-07T19:44:44.9709212Z libcxxabi-19.1.7 | hd85fd95_1 158 KB conda-forge 2025-05-07T19:44:44.9709703Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:44:44.9710187Z libllvm16-16.0.6 | hb3ce162_3 33.7 MB conda-forge 2025-05-07T19:44:44.9710640Z libxml2-2.12.7 | hc051c1a_1 688 KB conda-forge 2025-05-07T19:44:44.9711113Z libzlib-1.2.13 | h4ab18f5_6 60 KB conda-forge 2025-05-07T19:44:44.9711574Z llvm-openmp-16.0.6 | h4dfa4b3_0 39.9 MB conda-forge 2025-05-07T19:44:44.9712048Z zlib-1.2.13 | h4ab18f5_6 91 KB conda-forge 2025-05-07T19:44:44.9712490Z zstd-1.5.6 | ha6fb4c9_0 542 KB conda-forge 2025-05-07T19:44:44.9713031Z ------------------------------------------------------------ 2025-05-07T19:44:44.9713564Z Total: 142.6 MB 2025-05-07T19:44:44.9713794Z 2025-05-07T19:44:44.9714116Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:44.9714387Z 2025-05-07T19:44:44.9714645Z clang conda-forge/linux-64::clang-16.0.6-default_h9e3a008_14 2025-05-07T19:44:44.9715169Z clang-16 conda-forge/linux-64::clang-16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:44.9715768Z clangxx conda-forge/linux-64::clangxx-16.0.6-default_ha78316a_14 2025-05-07T19:44:44.9716330Z compiler-rt conda-forge/linux-64::compiler-rt-16.0.6-h00ab1b0_2 2025-05-07T19:44:44.9716907Z compiler-rt_linux~ conda-forge/noarch::compiler-rt_linux-64-16.0.6-h00ab1b0_2 2025-05-07T19:44:44.9717449Z icu conda-forge/linux-64::icu-73.2-h59595ed_0 2025-05-07T19:44:44.9718006Z libclang-cpp16 conda-forge/linux-64::libclang-cpp16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:44.9718569Z libcxx conda-forge/linux-64::libcxx-19.1.7-h2713693_1 2025-05-07T19:44:44.9719085Z libcxxabi conda-forge/linux-64::libcxxabi-19.1.7-hd85fd95_1 2025-05-07T19:44:44.9719577Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:44:44.9720092Z libllvm16 conda-forge/linux-64::libllvm16-16.0.6-hb3ce162_3 2025-05-07T19:44:44.9720604Z libxml2 conda-forge/linux-64::libxml2-2.12.7-hc051c1a_1 2025-05-07T19:44:44.9721069Z libzlib conda-forge/linux-64::libzlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:44.9721595Z llvm-openmp conda-forge/linux-64::llvm-openmp-16.0.6-h4dfa4b3_0 2025-05-07T19:44:44.9722407Z zstd conda-forge/linux-64::zstd-1.5.6-ha6fb4c9_0 2025-05-07T19:44:44.9722706Z 2025-05-07T19:44:44.9722833Z The following packages will be UPDATED: 2025-05-07T19:44:44.9723056Z 2025-05-07T19:44:44.9723352Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:44.9723712Z 2025-05-07T19:44:44.9723725Z 2025-05-07T19:44:44.9723729Z 2025-05-07T19:44:44.9723884Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:44.9724320Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:44.9724575Z 2025-05-07T19:44:44.9724905Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:44.9725195Z 2025-05-07T19:44:44.9725199Z 2025-05-07T19:44:44.9725430Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:44.9725697Z 2025-05-07T19:44:44.9725701Z 2025-05-07T19:44:44.9725752Z 2025-05-07T19:44:44.9725999Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:44.9726280Z 2025-05-07T19:44:44.9726284Z 2025-05-07T19:44:44.9726287Z 2025-05-07T19:44:44.9726290Z 2025-05-07T19:44:44.9742703Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:44.9743521Z 2025-05-07T19:44:44.9743571Z 2025-05-07T19:44:44.9743584Z 2025-05-07T19:44:44.9743594Z 2025-05-07T19:44:44.9744004Z 2025-05-07T19:44:44.9744730Z libcxx-19.1.7 | 1000 KB | | 0%  2025-05-07T19:44:44.9745564Z 2025-05-07T19:44:44.9745575Z 2025-05-07T19:44:44.9745586Z 2025-05-07T19:44:44.9745596Z 2025-05-07T19:44:44.9745606Z 2025-05-07T19:44:44.9745616Z 2025-05-07T19:44:44.9746343Z clang-16-16.0.6 | 780 KB | | 0%  2025-05-07T19:44:44.9747173Z 2025-05-07T19:44:44.9747185Z 2025-05-07T19:44:44.9747196Z 2025-05-07T19:44:44.9747207Z 2025-05-07T19:44:44.9747217Z 2025-05-07T19:44:44.9747227Z 2025-05-07T19:44:44.9747237Z 2025-05-07T19:44:44.9747924Z libiconv-1.18 | 696 KB | | 0%  2025-05-07T19:44:44.9748771Z 2025-05-07T19:44:44.9748781Z 2025-05-07T19:44:44.9748791Z 2025-05-07T19:44:44.9748801Z 2025-05-07T19:44:44.9748811Z 2025-05-07T19:44:44.9748822Z 2025-05-07T19:44:44.9748832Z 2025-05-07T19:44:44.9749059Z 2025-05-07T19:44:44.9749812Z libxml2-2.12.7 | 688 KB | | 0%  2025-05-07T19:44:44.9750678Z 2025-05-07T19:44:44.9750689Z 2025-05-07T19:44:44.9750699Z 2025-05-07T19:44:44.9750709Z 2025-05-07T19:44:44.9750719Z 2025-05-07T19:44:44.9750729Z 2025-05-07T19:44:44.9750739Z 2025-05-07T19:44:44.9750749Z 2025-05-07T19:44:44.9750759Z 2025-05-07T19:44:44.9751441Z zstd-1.5.6 | 542 KB | | 0%  2025-05-07T19:44:44.9752241Z 2025-05-07T19:44:44.9752252Z 2025-05-07T19:44:44.9752262Z 2025-05-07T19:44:44.9752273Z 2025-05-07T19:44:44.9752284Z 2025-05-07T19:44:44.9752294Z 2025-05-07T19:44:44.9752305Z 2025-05-07T19:44:44.9752315Z 2025-05-07T19:44:44.9752325Z 2025-05-07T19:44:44.9752336Z 2025-05-07T19:44:44.9753694Z libcxxabi-19.1.7 | 158 KB | | 0%  2025-05-07T19:44:44.9754070Z 2025-05-07T19:44:44.9754074Z 2025-05-07T19:44:44.9754078Z 2025-05-07T19:44:44.9754085Z 2025-05-07T19:44:44.9754089Z 2025-05-07T19:44:44.9754092Z 2025-05-07T19:44:44.9754100Z 2025-05-07T19:44:44.9754103Z 2025-05-07T19:44:44.9754107Z 2025-05-07T19:44:44.9754110Z 2025-05-07T19:44:44.9754113Z 2025-05-07T19:44:44.9754393Z clang-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:44.9754682Z 2025-05-07T19:44:44.9754685Z 2025-05-07T19:44:44.9754689Z 2025-05-07T19:44:44.9754692Z 2025-05-07T19:44:44.9754696Z 2025-05-07T19:44:44.9754699Z 2025-05-07T19:44:44.9754703Z 2025-05-07T19:44:44.9754706Z 2025-05-07T19:44:44.9754710Z 2025-05-07T19:44:44.9754713Z 2025-05-07T19:44:44.9754716Z 2025-05-07T19:44:44.9754720Z 2025-05-07T19:44:44.9755011Z clangxx-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:44.9755307Z 2025-05-07T19:44:44.9755311Z 2025-05-07T19:44:44.9755314Z 2025-05-07T19:44:44.9755318Z 2025-05-07T19:44:44.9755321Z 2025-05-07T19:44:44.9755326Z 2025-05-07T19:44:44.9755332Z 2025-05-07T19:44:44.9755336Z 2025-05-07T19:44:44.9755339Z 2025-05-07T19:44:44.9755343Z 2025-05-07T19:44:44.9755350Z 2025-05-07T19:44:44.9755353Z 2025-05-07T19:44:44.9755385Z 2025-05-07T19:44:44.9755670Z compiler-rt-16.0.6 | 107 KB | | 0%  2025-05-07T19:44:44.9755991Z 2025-05-07T19:44:44.9755995Z 2025-05-07T19:44:44.9755998Z 2025-05-07T19:44:44.9756002Z 2025-05-07T19:44:44.9756005Z 2025-05-07T19:44:44.9756009Z 2025-05-07T19:44:44.9756012Z 2025-05-07T19:44:44.9756015Z 2025-05-07T19:44:44.9756019Z 2025-05-07T19:44:44.9756047Z 2025-05-07T19:44:44.9756050Z 2025-05-07T19:44:44.9756053Z 2025-05-07T19:44:44.9756057Z 2025-05-07T19:44:44.9756060Z 2025-05-07T19:44:44.9756323Z zlib-1.2.13 | 91 KB | | 0%  2025-05-07T19:44:44.9756613Z 2025-05-07T19:44:44.9756616Z 2025-05-07T19:44:44.9756620Z 2025-05-07T19:44:44.9756624Z 2025-05-07T19:44:44.9756627Z 2025-05-07T19:44:44.9756664Z 2025-05-07T19:44:44.9756667Z 2025-05-07T19:44:44.9756671Z 2025-05-07T19:44:44.9756675Z 2025-05-07T19:44:44.9756742Z 2025-05-07T19:44:44.9756746Z 2025-05-07T19:44:44.9756749Z 2025-05-07T19:44:44.9756752Z 2025-05-07T19:44:44.9756756Z 2025-05-07T19:44:44.9756759Z 2025-05-07T19:44:45.1288343Z libzlib-1.2.13 | 60 KB | | 0%  2025-05-07T19:44:45.1289316Z 2025-05-07T19:44:45.1289330Z 2025-05-07T19:44:45.1289373Z 2025-05-07T19:44:45.1305684Z 2025-05-07T19:44:45.2330449Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:45.2330845Z 2025-05-07T19:44:45.2330850Z 2025-05-07T19:44:45.2330880Z 2025-05-07T19:44:45.2432712Z 2025-05-07T19:44:45.3267278Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:45.3328398Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:45.3329233Z 2025-05-07T19:44:45.3329247Z 2025-05-07T19:44:45.3329258Z 2025-05-07T19:44:45.3329727Z 2025-05-07T19:44:45.3350966Z icu-73.2 | 11.5 MB | ###7 | 38%  2025-05-07T19:44:45.3351251Z 2025-05-07T19:44:45.3351255Z 2025-05-07T19:44:45.3351259Z 2025-05-07T19:44:45.3420039Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:45.3421143Z 2025-05-07T19:44:45.3421149Z 2025-05-07T19:44:45.3435664Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:45.3436503Z 2025-05-07T19:44:45.4294096Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:45.4341024Z llvm-openmp-16.0.6 | 39.9 MB | #4 | 15% 2025-05-07T19:44:45.4341868Z 2025-05-07T19:44:45.4341884Z 2025-05-07T19:44:45.4341896Z 2025-05-07T19:44:45.4341906Z 2025-05-07T19:44:45.4420329Z icu-73.2 | 11.5 MB | ########8 | 89%  2025-05-07T19:44:45.4421165Z 2025-05-07T19:44:45.4421180Z 2025-05-07T19:44:45.4428716Z libllvm16-16.0.6 | 33.7 MB | ###4 | 35%  2025-05-07T19:44:45.4429562Z 2025-05-07T19:44:45.4429567Z 2025-05-07T19:44:45.4429571Z 2025-05-07T19:44:45.4476443Z libclang-cpp16-16.0. | 17.3 MB | ###3 | 34%  2025-05-07T19:44:45.4476775Z 2025-05-07T19:44:45.5294291Z compiler-rt_linux-64 | 36.0 MB | ###2 | 33%  2025-05-07T19:44:45.5359445Z llvm-openmp-16.0.6 | 39.9 MB | ##8 | 28% 2025-05-07T19:44:45.5360227Z 2025-05-07T19:44:45.5360232Z 2025-05-07T19:44:45.5360236Z 2025-05-07T19:44:45.5360239Z 2025-05-07T19:44:45.5430778Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:45.5431064Z 2025-05-07T19:44:45.5431069Z 2025-05-07T19:44:45.5431104Z 2025-05-07T19:44:45.5527414Z libclang-cpp16-16.0. | 17.3 MB | #########2 | 92%  2025-05-07T19:44:45.5527730Z 2025-05-07T19:44:45.5527735Z 2025-05-07T19:44:45.6041331Z libllvm16-16.0.6 | 33.7 MB | #####4 | 55%  2025-05-07T19:44:45.6041633Z 2025-05-07T19:44:45.6232907Z compiler-rt_linux-64 | 36.0 MB | #####1 | 51%  2025-05-07T19:44:45.6233284Z 2025-05-07T19:44:45.6233291Z 2025-05-07T19:44:45.6233297Z 2025-05-07T19:44:45.6233315Z 2025-05-07T19:44:45.6233368Z 2025-05-07T19:44:45.6293324Z libcxx-19.1.7 | 1000 KB | 1 | 2%  2025-05-07T19:44:45.6508100Z llvm-openmp-16.0.6 | 39.9 MB | #####2 | 52% 2025-05-07T19:44:45.6508924Z 2025-05-07T19:44:45.6508938Z 2025-05-07T19:44:45.6508949Z 2025-05-07T19:44:45.6508959Z 2025-05-07T19:44:45.6508970Z 2025-05-07T19:44:45.6524736Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:45.6525044Z 2025-05-07T19:44:45.6525049Z 2025-05-07T19:44:45.7049286Z libllvm16-16.0.6 | 33.7 MB | ########4 | 85%  2025-05-07T19:44:45.7049588Z 2025-05-07T19:44:45.7049593Z 2025-05-07T19:44:45.7049596Z 2025-05-07T19:44:45.7049600Z 2025-05-07T19:44:45.7049603Z 2025-05-07T19:44:45.7049607Z 2025-05-07T19:44:45.7251768Z clang-16-16.0.6 | 780 KB | 2 | 2%  2025-05-07T19:44:45.7252252Z 2025-05-07T19:44:45.7252365Z 2025-05-07T19:44:45.7252371Z 2025-05-07T19:44:45.7252614Z 2025-05-07T19:44:45.7252619Z 2025-05-07T19:44:45.7252625Z 2025-05-07T19:44:45.7297281Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:45.7375324Z llvm-openmp-16.0.6 | 39.9 MB | ####### | 70% 2025-05-07T19:44:45.7375645Z 2025-05-07T19:44:45.7430561Z compiler-rt_linux-64 | 36.0 MB | ######6 | 67%  2025-05-07T19:44:45.7432122Z 2025-05-07T19:44:45.7432141Z 2025-05-07T19:44:45.7432160Z 2025-05-07T19:44:45.7432180Z 2025-05-07T19:44:45.7432197Z 2025-05-07T19:44:45.7439273Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:45.7439769Z 2025-05-07T19:44:45.7439772Z 2025-05-07T19:44:45.7439776Z 2025-05-07T19:44:45.7439779Z 2025-05-07T19:44:45.7439790Z 2025-05-07T19:44:45.7869791Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:45.7871460Z 2025-05-07T19:44:45.7871477Z 2025-05-07T19:44:45.7871929Z 2025-05-07T19:44:45.8257353Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:45.8257883Z 2025-05-07T19:44:45.8257888Z 2025-05-07T19:44:45.8257891Z 2025-05-07T19:44:45.8257894Z 2025-05-07T19:44:45.8257898Z 2025-05-07T19:44:45.8257901Z 2025-05-07T19:44:45.8257905Z 2025-05-07T19:44:45.8330082Z libiconv-1.18 | 696 KB | 2 | 2%  2025-05-07T19:44:45.8330434Z 2025-05-07T19:44:45.8330439Z 2025-05-07T19:44:45.8330442Z 2025-05-07T19:44:45.8330446Z 2025-05-07T19:44:45.8330449Z 2025-05-07T19:44:45.8330452Z 2025-05-07T19:44:45.8330456Z 2025-05-07T19:44:45.8331059Z 2025-05-07T19:44:45.8378591Z libxml2-2.12.7 | 688 KB | 2 | 2%  2025-05-07T19:44:45.8378914Z 2025-05-07T19:44:45.8601572Z compiler-rt_linux-64 | 36.0 MB | ########1 | 81%  2025-05-07T19:44:45.8617681Z llvm-openmp-16.0.6 | 39.9 MB | ########6 | 87% 2025-05-07T19:44:45.8618514Z 2025-05-07T19:44:45.8618561Z 2025-05-07T19:44:45.8618602Z 2025-05-07T19:44:45.8618614Z 2025-05-07T19:44:45.8618641Z 2025-05-07T19:44:45.8618652Z 2025-05-07T19:44:45.8618662Z 2025-05-07T19:44:45.8733234Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:45.8733573Z 2025-05-07T19:44:45.8733579Z 2025-05-07T19:44:45.8733584Z 2025-05-07T19:44:45.8733589Z 2025-05-07T19:44:45.8733595Z 2025-05-07T19:44:45.8733600Z 2025-05-07T19:44:45.8733605Z 2025-05-07T19:44:45.8734941Z 2025-05-07T19:44:45.9268608Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:45.9268921Z 2025-05-07T19:44:45.9268938Z 2025-05-07T19:44:45.9268942Z 2025-05-07T19:44:45.9268946Z 2025-05-07T19:44:45.9268949Z 2025-05-07T19:44:45.9268953Z 2025-05-07T19:44:45.9268957Z 2025-05-07T19:44:45.9268960Z 2025-05-07T19:44:45.9268984Z 2025-05-07T19:44:45.9268987Z 2025-05-07T19:44:45.9340198Z libcxxabi-19.1.7 | 158 KB | # | 10%  2025-05-07T19:44:45.9340641Z 2025-05-07T19:44:45.9340782Z 2025-05-07T19:44:45.9340791Z 2025-05-07T19:44:45.9340813Z 2025-05-07T19:44:45.9340818Z 2025-05-07T19:44:45.9340823Z 2025-05-07T19:44:45.9340828Z 2025-05-07T19:44:45.9340832Z 2025-05-07T19:44:45.9340836Z 2025-05-07T19:44:45.9340841Z 2025-05-07T19:44:45.9368788Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:45.9369130Z 2025-05-07T19:44:45.9369134Z 2025-05-07T19:44:45.9369138Z 2025-05-07T19:44:45.9369141Z 2025-05-07T19:44:45.9369145Z 2025-05-07T19:44:45.9369149Z 2025-05-07T19:44:45.9369152Z 2025-05-07T19:44:45.9369155Z 2025-05-07T19:44:45.9369159Z 2025-05-07T19:44:45.9422509Z zstd-1.5.6 | 542 KB | 2 | 3%  2025-05-07T19:44:45.9422815Z 2025-05-07T19:44:45.9501760Z compiler-rt_linux-64 | 36.0 MB | #########5 | 96%  2025-05-07T19:44:45.9502106Z 2025-05-07T19:44:45.9502111Z 2025-05-07T19:44:45.9502115Z 2025-05-07T19:44:45.9502132Z 2025-05-07T19:44:45.9502136Z 2025-05-07T19:44:45.9502140Z 2025-05-07T19:44:45.9502143Z 2025-05-07T19:44:45.9502362Z 2025-05-07T19:44:45.9502365Z 2025-05-07T19:44:45.9753868Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:45.9754206Z 2025-05-07T19:44:45.9754210Z 2025-05-07T19:44:45.9754214Z 2025-05-07T19:44:45.9754218Z 2025-05-07T19:44:45.9754222Z 2025-05-07T19:44:45.9754225Z 2025-05-07T19:44:45.9754229Z 2025-05-07T19:44:45.9754232Z 2025-05-07T19:44:45.9754236Z 2025-05-07T19:44:45.9754239Z 2025-05-07T19:44:45.9754242Z 2025-05-07T19:44:45.9783130Z clang-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:45.9783467Z 2025-05-07T19:44:45.9783484Z 2025-05-07T19:44:45.9783488Z 2025-05-07T19:44:45.9783492Z 2025-05-07T19:44:45.9783495Z 2025-05-07T19:44:45.9783498Z 2025-05-07T19:44:45.9783502Z 2025-05-07T19:44:45.9783505Z 2025-05-07T19:44:45.9783509Z 2025-05-07T19:44:45.9783512Z 2025-05-07T19:44:45.9783704Z 2025-05-07T19:44:46.0029427Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:46.0029770Z 2025-05-07T19:44:46.0029774Z 2025-05-07T19:44:46.0029778Z 2025-05-07T19:44:46.0029782Z 2025-05-07T19:44:46.0029785Z 2025-05-07T19:44:46.0029789Z 2025-05-07T19:44:46.0029792Z 2025-05-07T19:44:46.0029796Z 2025-05-07T19:44:46.0029799Z 2025-05-07T19:44:46.0029803Z 2025-05-07T19:44:46.0029806Z 2025-05-07T19:44:46.0029810Z 2025-05-07T19:44:46.0057254Z clangxx-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:46.0057593Z 2025-05-07T19:44:46.0057597Z 2025-05-07T19:44:46.0057601Z 2025-05-07T19:44:46.0057604Z 2025-05-07T19:44:46.0057608Z 2025-05-07T19:44:46.0057611Z 2025-05-07T19:44:46.0057615Z 2025-05-07T19:44:46.0057618Z 2025-05-07T19:44:46.0057642Z 2025-05-07T19:44:46.0057646Z 2025-05-07T19:44:46.0057649Z 2025-05-07T19:44:46.0057653Z 2025-05-07T19:44:46.0197280Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:46.0197762Z 2025-05-07T19:44:46.0197800Z 2025-05-07T19:44:46.0197806Z 2025-05-07T19:44:46.0197810Z 2025-05-07T19:44:46.0197818Z 2025-05-07T19:44:46.0197826Z 2025-05-07T19:44:46.0197832Z 2025-05-07T19:44:46.0197836Z 2025-05-07T19:44:46.0197841Z 2025-05-07T19:44:46.0197849Z 2025-05-07T19:44:46.0197853Z 2025-05-07T19:44:46.0197857Z 2025-05-07T19:44:46.0197862Z 2025-05-07T19:44:46.0219100Z compiler-rt-16.0.6 | 107 KB | #4 | 15%  2025-05-07T19:44:46.0219482Z 2025-05-07T19:44:46.0219537Z 2025-05-07T19:44:46.0219541Z 2025-05-07T19:44:46.0219545Z 2025-05-07T19:44:46.0219559Z 2025-05-07T19:44:46.0219563Z 2025-05-07T19:44:46.0219566Z 2025-05-07T19:44:46.0219570Z 2025-05-07T19:44:46.0219575Z 2025-05-07T19:44:46.0219579Z 2025-05-07T19:44:46.0219584Z 2025-05-07T19:44:46.0219589Z 2025-05-07T19:44:46.0219594Z 2025-05-07T19:44:46.0753764Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:46.0754142Z 2025-05-07T19:44:46.0754157Z 2025-05-07T19:44:46.0754163Z 2025-05-07T19:44:46.0754168Z 2025-05-07T19:44:46.0754173Z 2025-05-07T19:44:46.0754178Z 2025-05-07T19:44:46.0754182Z 2025-05-07T19:44:46.0754187Z 2025-05-07T19:44:46.0754192Z 2025-05-07T19:44:46.0754198Z 2025-05-07T19:44:46.0754204Z 2025-05-07T19:44:46.0754208Z 2025-05-07T19:44:46.0754213Z 2025-05-07T19:44:46.0754217Z 2025-05-07T19:44:46.0754244Z 2025-05-07T19:44:46.0758757Z libzlib-1.2.13 | 60 KB | ##6 | 27%  2025-05-07T19:44:46.0759067Z 2025-05-07T19:44:46.0759071Z 2025-05-07T19:44:46.0759075Z 2025-05-07T19:44:46.0759078Z 2025-05-07T19:44:46.0759081Z 2025-05-07T19:44:46.0759162Z 2025-05-07T19:44:46.0762457Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:46.0762741Z 2025-05-07T19:44:46.0762745Z 2025-05-07T19:44:46.0762748Z 2025-05-07T19:44:46.0762758Z 2025-05-07T19:44:46.0762761Z 2025-05-07T19:44:46.0763790Z 2025-05-07T19:44:46.0771770Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:46.0772052Z 2025-05-07T19:44:46.0772064Z 2025-05-07T19:44:46.0772068Z 2025-05-07T19:44:46.0772072Z 2025-05-07T19:44:46.0772075Z 2025-05-07T19:44:46.0772079Z 2025-05-07T19:44:46.0772082Z 2025-05-07T19:44:46.0772085Z 2025-05-07T19:44:46.0772089Z 2025-05-07T19:44:46.0772092Z 2025-05-07T19:44:46.0772118Z 2025-05-07T19:44:46.0772121Z 2025-05-07T19:44:46.0772125Z 2025-05-07T19:44:46.0772128Z 2025-05-07T19:44:46.0772131Z 2025-05-07T19:44:46.0831603Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:46.0831970Z 2025-05-07T19:44:46.0831975Z 2025-05-07T19:44:46.0832003Z 2025-05-07T19:44:46.0832007Z 2025-05-07T19:44:46.0832010Z 2025-05-07T19:44:46.0832014Z 2025-05-07T19:44:46.0832017Z 2025-05-07T19:44:46.0832020Z 2025-05-07T19:44:46.0832197Z 2025-05-07T19:44:46.0832202Z 2025-05-07T19:44:46.0832206Z 2025-05-07T19:44:46.0832209Z 2025-05-07T19:44:46.0832218Z 2025-05-07T19:44:46.0832222Z 2025-05-07T19:44:46.0858711Z zlib-1.2.13 | 91 KB | #7 | 18%  2025-05-07T19:44:46.0859062Z 2025-05-07T19:44:46.0859253Z 2025-05-07T19:44:46.0859261Z 2025-05-07T19:44:46.0859266Z 2025-05-07T19:44:46.0859308Z 2025-05-07T19:44:46.0859313Z 2025-05-07T19:44:46.0859318Z 2025-05-07T19:44:46.0859322Z 2025-05-07T19:44:46.0859327Z 2025-05-07T19:44:46.0859332Z 2025-05-07T19:44:46.0859335Z 2025-05-07T19:44:46.0859340Z 2025-05-07T19:44:46.0859344Z 2025-05-07T19:44:46.0859348Z 2025-05-07T19:44:46.1346264Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:46.1346596Z 2025-05-07T19:44:46.1346622Z 2025-05-07T19:44:46.1346626Z 2025-05-07T19:44:46.1346629Z 2025-05-07T19:44:46.1346633Z 2025-05-07T19:44:46.1346636Z 2025-05-07T19:44:46.1346654Z 2025-05-07T19:44:46.1348891Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:46.1349200Z 2025-05-07T19:44:46.1349204Z 2025-05-07T19:44:46.1349208Z 2025-05-07T19:44:46.1349233Z 2025-05-07T19:44:46.1349237Z 2025-05-07T19:44:46.1349240Z 2025-05-07T19:44:46.1349248Z 2025-05-07T19:44:46.1588423Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:46.1588744Z 2025-05-07T19:44:46.1588749Z 2025-05-07T19:44:46.1588752Z 2025-05-07T19:44:46.1588756Z 2025-05-07T19:44:46.1852265Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:46.1852834Z 2025-05-07T19:44:46.1852863Z 2025-05-07T19:44:46.1908896Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:46.1909189Z 2025-05-07T19:44:46.1909194Z 2025-05-07T19:44:46.1909197Z 2025-05-07T19:44:46.1909201Z 2025-05-07T19:44:46.1909204Z 2025-05-07T19:44:46.1909208Z 2025-05-07T19:44:46.1909211Z 2025-05-07T19:44:46.1909229Z 2025-05-07T19:44:46.1909233Z 2025-05-07T19:44:46.1909267Z 2025-05-07T19:44:46.1914301Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:46.1914615Z 2025-05-07T19:44:46.1914619Z 2025-05-07T19:44:46.1914622Z 2025-05-07T19:44:46.1914626Z 2025-05-07T19:44:46.1914629Z 2025-05-07T19:44:46.1914632Z 2025-05-07T19:44:46.1914636Z 2025-05-07T19:44:46.1914639Z 2025-05-07T19:44:46.1914643Z 2025-05-07T19:44:46.1915582Z 2025-05-07T19:44:46.1983422Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:46.1983750Z 2025-05-07T19:44:46.1983755Z 2025-05-07T19:44:46.1983759Z 2025-05-07T19:44:46.1983763Z 2025-05-07T19:44:46.1983766Z 2025-05-07T19:44:46.1983770Z 2025-05-07T19:44:46.1983773Z 2025-05-07T19:44:46.1983777Z 2025-05-07T19:44:46.1984066Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:46.1984356Z 2025-05-07T19:44:46.1984360Z 2025-05-07T19:44:46.1984377Z 2025-05-07T19:44:46.1984381Z 2025-05-07T19:44:46.1984384Z 2025-05-07T19:44:46.1984588Z 2025-05-07T19:44:46.1984591Z 2025-05-07T19:44:46.1984603Z 2025-05-07T19:44:46.2301665Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:46.2301996Z 2025-05-07T19:44:46.2302000Z 2025-05-07T19:44:46.2302004Z 2025-05-07T19:44:46.2302007Z 2025-05-07T19:44:46.2302011Z 2025-05-07T19:44:46.2302014Z 2025-05-07T19:44:46.2302018Z 2025-05-07T19:44:46.2302021Z 2025-05-07T19:44:46.2302047Z 2025-05-07T19:44:46.2302295Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:46.2302571Z 2025-05-07T19:44:46.2302575Z 2025-05-07T19:44:46.2302578Z 2025-05-07T19:44:46.2302582Z 2025-05-07T19:44:46.2302585Z 2025-05-07T19:44:46.2302589Z 2025-05-07T19:44:46.2302592Z 2025-05-07T19:44:46.2302596Z 2025-05-07T19:44:46.2302599Z 2025-05-07T19:44:46.2871767Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:46.2872125Z 2025-05-07T19:44:46.2872129Z 2025-05-07T19:44:46.2872132Z 2025-05-07T19:44:46.2872142Z 2025-05-07T19:44:46.2872146Z 2025-05-07T19:44:46.2872150Z 2025-05-07T19:44:46.2872153Z 2025-05-07T19:44:46.2872157Z 2025-05-07T19:44:46.2872160Z 2025-05-07T19:44:46.2872163Z 2025-05-07T19:44:46.2872167Z 2025-05-07T19:44:46.2873249Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:46.2873545Z 2025-05-07T19:44:46.2873548Z 2025-05-07T19:44:46.2873552Z 2025-05-07T19:44:46.2873555Z 2025-05-07T19:44:46.2873558Z 2025-05-07T19:44:46.2873562Z 2025-05-07T19:44:46.2873565Z 2025-05-07T19:44:46.2873568Z 2025-05-07T19:44:46.2873572Z 2025-05-07T19:44:46.2873575Z 2025-05-07T19:44:46.2873594Z 2025-05-07T19:44:46.3206728Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:46.3207054Z 2025-05-07T19:44:46.3207256Z 2025-05-07T19:44:46.3207265Z 2025-05-07T19:44:46.3207270Z 2025-05-07T19:44:46.3207291Z 2025-05-07T19:44:46.3207296Z 2025-05-07T19:44:46.3207300Z 2025-05-07T19:44:46.3207315Z 2025-05-07T19:44:46.3207320Z 2025-05-07T19:44:46.3207359Z 2025-05-07T19:44:46.3207364Z 2025-05-07T19:44:46.3207368Z 2025-05-07T19:44:46.3207835Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:46.3208161Z 2025-05-07T19:44:46.3208165Z 2025-05-07T19:44:46.3208168Z 2025-05-07T19:44:46.3208171Z 2025-05-07T19:44:46.3208175Z 2025-05-07T19:44:46.3208202Z 2025-05-07T19:44:46.3208205Z 2025-05-07T19:44:46.3208209Z 2025-05-07T19:44:46.3208212Z 2025-05-07T19:44:46.3208215Z 2025-05-07T19:44:46.3208219Z 2025-05-07T19:44:46.3208222Z 2025-05-07T19:44:46.3249458Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:46.3249979Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:46.3250243Z 2025-05-07T19:44:46.3250248Z 2025-05-07T19:44:46.3250251Z 2025-05-07T19:44:46.3250268Z 2025-05-07T19:44:46.3250271Z 2025-05-07T19:44:46.3250275Z 2025-05-07T19:44:46.3250278Z 2025-05-07T19:44:46.3250287Z 2025-05-07T19:44:46.3250290Z 2025-05-07T19:44:46.3250294Z 2025-05-07T19:44:46.3250319Z 2025-05-07T19:44:46.3250323Z 2025-05-07T19:44:46.3250326Z 2025-05-07T19:44:46.3250638Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:46.3250964Z 2025-05-07T19:44:46.3250968Z 2025-05-07T19:44:46.3250971Z 2025-05-07T19:44:46.3250975Z 2025-05-07T19:44:46.3250978Z 2025-05-07T19:44:46.3250982Z 2025-05-07T19:44:46.3251010Z 2025-05-07T19:44:46.3251014Z 2025-05-07T19:44:46.3251017Z 2025-05-07T19:44:46.3251021Z 2025-05-07T19:44:46.3251024Z 2025-05-07T19:44:46.3251027Z 2025-05-07T19:44:46.3251031Z 2025-05-07T19:44:46.3374631Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:46.3375031Z 2025-05-07T19:44:46.3375036Z 2025-05-07T19:44:46.3375039Z 2025-05-07T19:44:46.3375055Z 2025-05-07T19:44:46.3375059Z 2025-05-07T19:44:46.3375063Z 2025-05-07T19:44:46.3375066Z 2025-05-07T19:44:46.3375233Z 2025-05-07T19:44:46.3375237Z 2025-05-07T19:44:46.3375240Z 2025-05-07T19:44:46.3375244Z 2025-05-07T19:44:46.3375248Z 2025-05-07T19:44:46.3375252Z 2025-05-07T19:44:46.3375255Z 2025-05-07T19:44:46.3375259Z 2025-05-07T19:44:46.3375577Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:46.3375912Z 2025-05-07T19:44:46.3375916Z 2025-05-07T19:44:46.3375919Z 2025-05-07T19:44:46.3375923Z 2025-05-07T19:44:46.3375926Z 2025-05-07T19:44:46.3375930Z 2025-05-07T19:44:46.3375933Z 2025-05-07T19:44:46.3375937Z 2025-05-07T19:44:46.3375940Z 2025-05-07T19:44:46.3375943Z 2025-05-07T19:44:46.3375947Z 2025-05-07T19:44:46.3375950Z 2025-05-07T19:44:46.3375954Z 2025-05-07T19:44:46.3375957Z 2025-05-07T19:44:46.3375961Z 2025-05-07T19:44:46.3446644Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:46.3447001Z 2025-05-07T19:44:46.3447006Z 2025-05-07T19:44:46.3447016Z 2025-05-07T19:44:46.3447020Z 2025-05-07T19:44:46.3447023Z 2025-05-07T19:44:46.3447027Z 2025-05-07T19:44:46.3447030Z 2025-05-07T19:44:46.3447034Z 2025-05-07T19:44:46.3447038Z 2025-05-07T19:44:46.3447064Z 2025-05-07T19:44:46.3447067Z 2025-05-07T19:44:46.3447071Z 2025-05-07T19:44:46.3447074Z 2025-05-07T19:44:46.3447078Z 2025-05-07T19:44:46.3447358Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:46.3447647Z 2025-05-07T19:44:46.3447651Z 2025-05-07T19:44:46.3447654Z 2025-05-07T19:44:46.3447658Z 2025-05-07T19:44:46.3447661Z 2025-05-07T19:44:46.3447688Z 2025-05-07T19:44:46.3447691Z 2025-05-07T19:44:46.3447695Z 2025-05-07T19:44:46.3447698Z 2025-05-07T19:44:46.3447701Z 2025-05-07T19:44:46.3447705Z 2025-05-07T19:44:46.3447708Z 2025-05-07T19:44:46.3447711Z 2025-05-07T19:44:46.3447715Z 2025-05-07T19:44:46.3611126Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:46.3611472Z 2025-05-07T19:44:46.4127554Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:46.4128380Z 2025-05-07T19:44:46.4128393Z 2025-05-07T19:44:46.4128405Z 2025-05-07T19:44:46.8053097Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:46.8054029Z 2025-05-07T19:44:46.8078692Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:46.8079536Z 2025-05-07T19:44:46.8079550Z 2025-05-07T19:44:46.8543415Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:46.8550837Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:46.8551693Z 2025-05-07T19:44:46.8552185Z 2025-05-07T19:44:46.8552513Z  2025-05-07T19:44:46.8552799Z 2025-05-07T19:44:46.8552804Z 2025-05-07T19:44:46.8553081Z  2025-05-07T19:44:46.8553394Z 2025-05-07T19:44:46.8553406Z 2025-05-07T19:44:46.8553410Z 2025-05-07T19:44:46.8553631Z  2025-05-07T19:44:46.8553859Z 2025-05-07T19:44:46.8553863Z 2025-05-07T19:44:46.8553867Z 2025-05-07T19:44:46.8553870Z 2025-05-07T19:44:46.8554056Z  2025-05-07T19:44:46.8554311Z 2025-05-07T19:44:46.8554316Z 2025-05-07T19:44:46.8554319Z 2025-05-07T19:44:46.8554322Z 2025-05-07T19:44:46.8554327Z 2025-05-07T19:44:46.8554511Z  2025-05-07T19:44:46.8554769Z 2025-05-07T19:44:46.8554772Z 2025-05-07T19:44:46.8554776Z 2025-05-07T19:44:46.8554779Z 2025-05-07T19:44:46.8554782Z 2025-05-07T19:44:46.8554786Z 2025-05-07T19:44:46.8554981Z  2025-05-07T19:44:46.8555216Z 2025-05-07T19:44:46.8555225Z 2025-05-07T19:44:46.8555229Z 2025-05-07T19:44:46.8555233Z 2025-05-07T19:44:46.8555487Z 2025-05-07T19:44:46.8555491Z 2025-05-07T19:44:46.8555495Z 2025-05-07T19:44:46.8555694Z  2025-05-07T19:44:46.8555927Z 2025-05-07T19:44:46.8555931Z 2025-05-07T19:44:46.8555935Z 2025-05-07T19:44:46.8555939Z 2025-05-07T19:44:46.8555943Z 2025-05-07T19:44:46.8555946Z 2025-05-07T19:44:46.8555949Z 2025-05-07T19:44:46.8555976Z 2025-05-07T19:44:46.8556171Z  2025-05-07T19:44:46.8556407Z 2025-05-07T19:44:46.8556411Z 2025-05-07T19:44:46.8556414Z 2025-05-07T19:44:46.8556418Z 2025-05-07T19:44:46.8556421Z 2025-05-07T19:44:46.8556425Z 2025-05-07T19:44:46.8556428Z 2025-05-07T19:44:46.8556431Z 2025-05-07T19:44:46.8556435Z 2025-05-07T19:44:46.8556660Z  2025-05-07T19:44:46.8556993Z 2025-05-07T19:44:46.8556997Z 2025-05-07T19:44:46.8557001Z 2025-05-07T19:44:46.8557004Z 2025-05-07T19:44:46.8557012Z 2025-05-07T19:44:46.8557015Z 2025-05-07T19:44:46.8557019Z 2025-05-07T19:44:46.8557022Z 2025-05-07T19:44:46.8557025Z 2025-05-07T19:44:46.8557029Z 2025-05-07T19:44:46.8557261Z  2025-05-07T19:44:46.8557502Z 2025-05-07T19:44:46.8557505Z 2025-05-07T19:44:46.8557509Z 2025-05-07T19:44:46.8557513Z 2025-05-07T19:44:46.8557517Z 2025-05-07T19:44:46.8557520Z 2025-05-07T19:44:46.8557524Z 2025-05-07T19:44:46.8557528Z 2025-05-07T19:44:46.8557531Z 2025-05-07T19:44:46.8557534Z 2025-05-07T19:44:46.8557561Z 2025-05-07T19:44:46.8557794Z  2025-05-07T19:44:46.8558037Z 2025-05-07T19:44:46.8558040Z 2025-05-07T19:44:46.8558044Z 2025-05-07T19:44:46.8558047Z 2025-05-07T19:44:46.8558077Z 2025-05-07T19:44:46.8558080Z 2025-05-07T19:44:46.8558087Z 2025-05-07T19:44:46.8558091Z 2025-05-07T19:44:46.8558094Z 2025-05-07T19:44:46.8558101Z 2025-05-07T19:44:46.8558105Z 2025-05-07T19:44:46.8558108Z 2025-05-07T19:44:46.8558316Z  2025-05-07T19:44:46.8558563Z 2025-05-07T19:44:46.8558566Z 2025-05-07T19:44:46.8558599Z 2025-05-07T19:44:46.8558602Z 2025-05-07T19:44:46.8558605Z 2025-05-07T19:44:46.8558609Z 2025-05-07T19:44:46.8558612Z 2025-05-07T19:44:46.8558615Z 2025-05-07T19:44:46.8558619Z 2025-05-07T19:44:46.8558622Z 2025-05-07T19:44:46.8558625Z 2025-05-07T19:44:46.8558629Z 2025-05-07T19:44:46.8558632Z 2025-05-07T19:44:46.8558841Z  2025-05-07T19:44:46.8559112Z 2025-05-07T19:44:46.8559115Z 2025-05-07T19:44:46.8559118Z 2025-05-07T19:44:46.8559122Z 2025-05-07T19:44:46.8559125Z 2025-05-07T19:44:46.8559128Z 2025-05-07T19:44:46.8559132Z 2025-05-07T19:44:46.8559139Z 2025-05-07T19:44:46.8559143Z 2025-05-07T19:44:46.8559146Z 2025-05-07T19:44:46.8559153Z 2025-05-07T19:44:46.8559156Z 2025-05-07T19:44:46.8559160Z 2025-05-07T19:44:46.8559163Z 2025-05-07T19:44:46.8559383Z  2025-05-07T19:44:46.8559656Z 2025-05-07T19:44:46.8559661Z 2025-05-07T19:44:46.8559664Z 2025-05-07T19:44:46.8559667Z 2025-05-07T19:44:46.8559671Z 2025-05-07T19:44:46.8559674Z 2025-05-07T19:44:46.8559678Z 2025-05-07T19:44:46.8559681Z 2025-05-07T19:44:46.8559684Z 2025-05-07T19:44:46.8559688Z 2025-05-07T19:44:46.8559691Z 2025-05-07T19:44:46.8559695Z 2025-05-07T19:44:46.8559698Z 2025-05-07T19:44:46.8559702Z 2025-05-07T19:44:46.8559705Z 2025-05-07T19:44:46.8559998Z  done 2025-05-07T19:44:46.9562294Z Preparing transaction: \ done 2025-05-07T19:44:47.0569978Z Verifying transaction: / done 2025-05-07T19:44:47.1583390Z Executing transaction: \ done 2025-05-07T19:44:47.2497000Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:50.9900919Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:50.9901482Z 2025-05-07T19:44:50.9913784Z 2025-05-07T19:44:50.9929783Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:50.9931280Z 2025-05-07T19:44:50.9943019Z 2025-05-07T19:44:50.9962514Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:50.9964032Z 2025-05-07T19:44:50.9975950Z 2025-05-07T19:44:50.9991649Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:50.9993172Z 2025-05-07T19:44:51.0005185Z 2025-05-07T19:44:51.0005385Z [INSTALL] Removing GCC package activation scripts ... 2025-05-07T19:44:52.8747598Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:44:52.8748184Z 2025-05-07T19:44:52.8748366Z total 28 2025-05-07T19:44:52.8748675Z drwxr-xr-x. 2 root root 134 May 7 19:44 . 2025-05-07T19:44:52.8749191Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:44:52.8749659Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:44:52.8750189Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:44:52.8750659Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:44:52.8751135Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:44:52.8751420Z 2025-05-07T19:44:52.8751777Z + rm -rf /github/home/miniconda/envs/build_binary/etc/conda/activate.d/activate-gcc_linux-64.sh 2025-05-07T19:44:52.8752214Z 2025-05-07T19:44:52.8761312Z 2025-05-07T19:44:52.8761657Z + rm -rf /github/home/miniconda/envs/build_binary/etc/conda/activate.d/activate-gxx_linux-64.sh 2025-05-07T19:44:52.8762129Z 2025-05-07T19:44:52.8781762Z 2025-05-07T19:44:52.8782779Z + conda env config vars set -n build_binary CC= 2025-05-07T19:44:52.8783567Z 2025-05-07T19:44:53.3007366Z 2025-05-07T19:44:53.3007883Z + conda env config vars set -n build_binary CXX= 2025-05-07T19:44:53.3008153Z 2025-05-07T19:44:53.7184710Z 2025-05-07T19:44:53.7185674Z + conda run -n build_binary printenv CC 2025-05-07T19:44:53.7186381Z 2025-05-07T19:44:55.3058803Z 2025-05-07T19:44:55.3058888Z 2025-05-07T19:44:55.3857079Z 2025-05-07T19:44:55.3857710Z + conda run -n build_binary printenv CXX 2025-05-07T19:44:55.3858051Z 2025-05-07T19:44:56.9663313Z 2025-05-07T19:44:56.9663336Z 2025-05-07T19:44:57.0234739Z 2025-05-07T19:44:58.6590589Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib ... 2025-05-07T19:45:00.2428313Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:45:00.3023724Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib 2025-05-07T19:45:00.3025096Z 2025-05-07T19:45:00.7236922Z 2025-05-07T19:45:02.3107244Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:45:02.3108067Z 2025-05-07T19:45:02.3687957Z [CHECK] Binary cc found in PATH 2025-05-07T19:45:03.9483605Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:45:03.9484392Z 2025-05-07T19:45:04.0068329Z [CHECK] Binary gcc found in PATH 2025-05-07T19:45:05.5798480Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:45:05.5799266Z 2025-05-07T19:45:05.6378402Z [CHECK] Binary c++ found in PATH 2025-05-07T19:45:07.2188751Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:45:07.2189651Z 2025-05-07T19:45:07.2786172Z [CHECK] Binary g++ found in PATH 2025-05-07T19:45:07.2787411Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:45:07.2788686Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:45:07.2789312Z 2025-05-07T19:45:08.8783553Z #define _LP64 1 2025-05-07T19:45:08.8783926Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:08.8784564Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:08.8784835Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:08.8785114Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:08.8785383Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:08.8785660Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:08.8785923Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:08.8786242Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:08.8786543Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:08.8786845Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:08.8787202Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:08.8787513Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:08.8787816Z #define __CHAR_BIT__ 8 2025-05-07T19:45:08.8788075Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:08.8788414Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:08.8790615Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:08.8790992Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:08.8791325Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:08.8791662Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:08.8791995Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:08.8792322Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:08.8792672Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:08.8792998Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:08.8793338Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:08.8793634Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:08.8793963Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:08.8794291Z #define __DBL_DIG__ 15 2025-05-07T19:45:08.8794576Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:08.8794918Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:08.8795188Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:08.8795483Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.8795756Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:08.8796037Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:08.8796317Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:08.8796622Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:08.8796950Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:08.8797254Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:08.8797534Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:08.8797877Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:08.8798183Z #define __ELF__ 1 2025-05-07T19:45:08.8798433Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:08.8798699Z #define __FLOAT128__ 1 2025-05-07T19:45:08.8798963Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:08.8799288Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:08.8799618Z #define __FLT16_DIG__ 3 2025-05-07T19:45:08.8800023Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:08.8800330Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:08.8800629Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:08.8800914Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.8801343Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:08.8801607Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:08.8801883Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:08.8802159Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:08.8802436Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:08.8802725Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:08.8802992Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:08.8803300Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:08.8803576Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:08.8803884Z #define __FLT_DIG__ 6 2025-05-07T19:45:08.8804127Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:08.8804434Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:08.8804697Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:08.8804978Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.8805256Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:08.8805512Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:08.8805789Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:08.8806161Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:08.8806456Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:08.8806725Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:08.8807007Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:08.8807275Z #define __FLT_RADIX__ 2 2025-05-07T19:45:08.8807522Z #define __FXSR__ 1 2025-05-07T19:45:08.8807752Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:08.8808057Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:08.8808380Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:08.8808692Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:08.8809022Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:08.8809318Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:08.8809638Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:08.8809938Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:08.8810345Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:08.8810655Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:08.8810984Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:08.8811302Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:08.8811624Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:08.8811950Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:08.8812281Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:08.8812635Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:08.8812965Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:08.8813284Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:08.8813540Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:08.8813826Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:45:08.8814081Z #define __GNUC__ 4 2025-05-07T19:45:08.8814510Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:08.8814796Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:08.8815056Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:08.8815335Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:08.8815590Z #define __INT16_MAX__ 32767 2025-05-07T19:45:08.8815870Z #define __INT16_TYPE__ short 2025-05-07T19:45:08.8816134Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:08.8816407Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:08.8816662Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:08.8816940Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:08.8817212Z #define __INT32_TYPE__ int 2025-05-07T19:45:08.8817485Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:08.8817752Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:08.8818029Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:08.8818317Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.8818621Z #define __INT64_TYPE__ long int 2025-05-07T19:45:08.8818915Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:08.8819173Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:08.8819448Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:08.8819840Z #define __INT8_MAX__ 127 2025-05-07T19:45:08.8820119Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:08.8820399Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:08.8820690Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:08.8820953Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:08.8821246Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:08.8821577Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:08.8821853Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:08.8822329Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:08.8822597Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:08.8822893Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:08.8823199Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:08.8823492Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:08.8823755Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:08.8824049Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:08.8824326Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:08.8824620Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:08.8824919Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:08.8825194Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:08.8825484Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:08.8825938Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:08.8826249Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:08.8826518Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:08.8826812Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:08.8827095Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:08.8827417Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.8827751Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:08.8828065Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:08.8828355Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:08.8828630Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:08.8828924Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:08.8829203Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:08.8829518Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:08.8829788Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:08.8830203Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:08.8830488Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:08.8830792Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:08.8831099Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:08.8831378Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:08.8831674Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:08.8831952Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:08.8832274Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:08.8832552Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:08.8832842Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:08.8833123Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:08.8833442Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.8833774Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:08.8834084Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:08.8834377Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:08.8834773Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:08.8835066Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:08.8835338Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:08.8835653Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:08.8835913Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:08.8836181Z #define __INT_WIDTH__ 32 2025-05-07T19:45:08.8836426Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:08.8836752Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:08.8837087Z #define __LDBL_DIG__ 18 2025-05-07T19:45:08.8837379Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:08.8837721Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:08.8837984Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:08.8838270Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.8838541Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:08.8838821Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:08.8839091Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:08.8839400Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:08.8839732Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:08.8840034Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:08.8840339Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:08.8840671Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:08.8840939Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:08.8841208Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:08.8841540Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:08.8841830Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:08.8842088Z #define __LP64__ 1 2025-05-07T19:45:08.8842300Z #define __MMX__ 1 2025-05-07T19:45:08.8842538Z #define __NO_INLINE__ 1 2025-05-07T19:45:08.8842780Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:08.8843053Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:08.8843347Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:08.8843697Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:08.8844026Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:08.8844354Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:08.8844686Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:08.8845260Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:08.8845533Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:08.8845841Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:08.8846102Z #define __PIC__ 2 2025-05-07T19:45:08.8846341Z #define __PIE__ 2 2025-05-07T19:45:08.8846586Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:08.8846845Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:08.8847140Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:08.8847397Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:08.8847680Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:08.8847973Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:08.8848252Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:08.8848500Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:08.8848760Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:08.8848987Z #define __SEG_FS 1 2025-05-07T19:45:08.8849314Z #define __SEG_GS 1 2025-05-07T19:45:08.8849546Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:08.8849780Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:08.8850048Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:08.8850324Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:08.8850599Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:08.8850862Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:08.8851156Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:08.8851408Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:08.8851696Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:08.8851953Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:08.8852261Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:08.8852555Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:08.8852806Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:08.8853105Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:08.8853367Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:08.8853643Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:08.8853897Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:08.8854178Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:08.8854435Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:08.8854722Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:08.8854961Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:08.8855210Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:08.8855460Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.8855767Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:08.8856061Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:08.8856291Z #define __SSE2_MATH__ 1 2025-05-07T19:45:08.8856525Z #define __SSE2__ 1 2025-05-07T19:45:08.8856733Z #define __SSE_MATH__ 1 2025-05-07T19:45:08.8856963Z #define __SSE__ 1 2025-05-07T19:45:08.8857172Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:08.8857425Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:08.8857657Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:08.8857935Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:08.8858195Z #define __STDC__ 1 2025-05-07T19:45:08.8858460Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:08.8858752Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:08.8859021Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:08.8859312Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:08.8859654Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:08.8859964Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:08.8860445Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:08.8860802Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:08.8861090Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:08.8861396Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:08.8861668Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:08.8861965Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:08.8862248Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:08.8862578Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:08.8862916Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:08.8863194Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:08.8863500Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:08.8863782Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:08.8864094Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:08.8864389Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.8864871Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:08.8865199Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:08.8865508Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:08.8865813Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:08.8866086Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:08.8866395Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:08.8866670Z #define __UINT8_MAX__ 255 2025-05-07T19:45:08.8866978Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:08.8867285Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:08.8867599Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:08.8867882Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:08.8868195Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:08.8868480Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:08.8868801Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.8869171Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:08.8869580Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:08.8869884Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:08.8870169Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:08.8870482Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:08.8870766Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:08.8871095Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.8871443Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:08.8872014Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:08.8872308Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:08.8872756Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:08.8873087Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:08.8873384Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:08.8873690Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:08.8873991Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:08.8874341Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:08.8874631Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:08.8874944Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:08.8875233Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:08.8875548Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:08.8875867Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:08.8876210Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:08.8876527Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:08.8876821Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:08.8877132Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:08.8877449Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.8877837Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:08.8878165Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:08.8878592Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:08.8878867Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:08.8879161Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:08.8879458Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:08.8879736Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:08.8880074Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:08.8880366Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:08.8880655Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:08.8880923Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:08.8881204Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:08.8881486Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:08.8881802Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:08.8882085Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:08.8882348Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:08.8882632Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:08.8882901Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:08.8883220Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:08.8883517Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:08.8883808Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:08.8884077Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:08.8884369Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:08.8884681Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.8885108Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:08.8885612Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:08.8885895Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:08.8886197Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:08.8886477Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:08.8886777Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:08.8887064Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:08.8887394Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:08.8888031Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:08.8888650Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:08.8888933Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:08.8889181Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:08.8889446Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:08.8889801Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:08.8890094Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:08.8890347Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:08.8890596Z #define __amd64 1 2025-05-07T19:45:08.8890806Z #define __amd64__ 1 2025-05-07T19:45:08.8891038Z #define __clang__ 1 2025-05-07T19:45:08.8891303Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:08.8891603Z #define __clang_major__ 16 2025-05-07T19:45:08.8891870Z #define __clang_minor__ 0 2025-05-07T19:45:08.8892121Z #define __clang_patchlevel__ 6 2025-05-07T19:45:08.8892729Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:08.8893380Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:08.8893731Z #define __code_model_small__ 1 2025-05-07T19:45:08.8893990Z #define __gnu_linux__ 1 2025-05-07T19:45:08.8894230Z #define __k8 1 2025-05-07T19:45:08.8894437Z #define __k8__ 1 2025-05-07T19:45:08.8894662Z #define __linux 1 2025-05-07T19:45:08.8894887Z #define __linux__ 1 2025-05-07T19:45:08.8895100Z #define __llvm__ 1 2025-05-07T19:45:08.8895327Z #define __pic__ 2 2025-05-07T19:45:08.8895536Z #define __pie__ 2 2025-05-07T19:45:08.8895811Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:08.8896187Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:08.8896525Z #define __tune_k8__ 1 2025-05-07T19:45:08.8896751Z #define __unix 1 2025-05-07T19:45:08.8896977Z #define __unix__ 1 2025-05-07T19:45:08.8897188Z #define __x86_64 1 2025-05-07T19:45:08.8897414Z #define __x86_64__ 1 2025-05-07T19:45:08.8897649Z #define linux 1 2025-05-07T19:45:08.8897854Z #define unix 1 2025-05-07T19:45:08.8897978Z 2025-05-07T19:45:08.9347644Z 2025-05-07T19:45:08.9348951Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:45:08.9349517Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:45:08.9349809Z 2025-05-07T19:45:10.5494712Z #define _GNU_SOURCE 1 2025-05-07T19:45:10.5495484Z #define _LP64 1 2025-05-07T19:45:10.5495801Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:10.5496234Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:10.5496498Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:10.5496772Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:10.5497025Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:10.5497287Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:10.5497551Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:10.5497872Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:10.5498153Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:10.5498445Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:10.5498777Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:10.5499090Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:10.5499389Z #define __CHAR_BIT__ 8 2025-05-07T19:45:10.5499761Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:10.5500102Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:10.5500442Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:10.5500777Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:10.5501371Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:10.5501695Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:10.5502005Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:10.5502337Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:10.5502657Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:10.5502989Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:10.5503317Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:10.5503601Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:10.5503915Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:10.5504234Z #define __DBL_DIG__ 15 2025-05-07T19:45:10.5504505Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:10.5504815Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:10.5505093Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:10.5505360Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.5505757Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:10.5506040Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:10.5506416Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:10.5506694Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:10.5506990Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:10.5507276Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:10.5507545Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:10.5507872Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:10.5508169Z #define __DEPRECATED 1 2025-05-07T19:45:10.5508413Z #define __ELF__ 1 2025-05-07T19:45:10.5508628Z #define __EXCEPTIONS 1 2025-05-07T19:45:10.5508880Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:10.5509148Z #define __FLOAT128__ 1 2025-05-07T19:45:10.5509379Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:10.5509691Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:10.5510011Z #define __FLT16_DIG__ 3 2025-05-07T19:45:10.5510279Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:10.5510572Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:10.5510852Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:10.5511121Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.5511407Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:10.5511666Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:10.5511941Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:10.5512215Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:10.5512640Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:10.5512930Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:10.5513192Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:10.5513492Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:10.5513760Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:10.5514060Z #define __FLT_DIG__ 6 2025-05-07T19:45:10.5514294Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:10.5514590Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:10.5514846Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:10.5515124Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.5515394Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:10.5515645Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:10.5515914Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:10.5516162Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:10.5516453Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:10.5516718Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:10.5516993Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:10.5517260Z #define __FLT_RADIX__ 2 2025-05-07T19:45:10.5517496Z #define __FXSR__ 1 2025-05-07T19:45:10.5517720Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:10.5518016Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:10.5518326Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:10.5518633Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:10.5518945Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:10.5519230Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:10.5519534Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:10.5519824Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:10.5520219Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:10.5520520Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:10.5520838Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:10.5521163Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:10.5521462Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:10.5521782Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:10.5522446Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:10.5522856Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:10.5523186Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:10.5523519Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:45:10.5523816Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:45:10.5524127Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:45:10.5524388Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:10.5524767Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:10.5525041Z #define __GNUC__ 4 2025-05-07T19:45:10.5525255Z #define __GNUG__ 4 2025-05-07T19:45:10.5525507Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:10.5525788Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:45:10.5526089Z #define __GXX_RTTI 1 2025-05-07T19:45:10.5526320Z #define __GXX_WEAK__ 1 2025-05-07T19:45:10.5526580Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:10.5526833Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:10.5527098Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:10.5527346Z #define __INT16_MAX__ 32767 2025-05-07T19:45:10.5527611Z #define __INT16_TYPE__ short 2025-05-07T19:45:10.5527882Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:10.5528128Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:10.5528387Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:10.5528635Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:10.5528915Z #define __INT32_TYPE__ int 2025-05-07T19:45:10.5529160Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:10.5529430Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:10.5529682Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:10.5529958Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:10.5530256Z #define __INT64_TYPE__ long int 2025-05-07T19:45:10.5530533Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:10.5530780Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:10.5531043Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:10.5531305Z #define __INT8_MAX__ 127 2025-05-07T19:45:10.5531555Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:10.5531848Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:10.5532137Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:10.5532413Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:10.5532686Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:10.5533005Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:10.5533278Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:10.5533557Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:10.5533824Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:10.5534117Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:10.5534426Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:10.5534719Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:10.5535097Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:10.5535370Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:10.5535644Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:10.5535900Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:10.5536178Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:10.5536431Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:10.5536699Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:10.5536952Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:10.5537238Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:10.5537540Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:10.5537809Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:10.5538067Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:10.5538357Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:10.5538679Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:10.5538954Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:10.5539224Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:10.5539659Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:10.5539927Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:10.5540365Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:10.5540671Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:10.5541021Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:10.5541316Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:10.5541587Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:10.5541882Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:10.5542177Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:10.5542446Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:10.5542726Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:10.5543001Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:10.5543309Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:10.5543579Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:10.5543940Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:10.5544220Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:10.5544542Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:10.5544865Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:10.5545168Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:10.5545457Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:10.5545735Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:10.5546026Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:10.5546303Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:10.5546612Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:10.5546874Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:10.5547145Z #define __INT_WIDTH__ 32 2025-05-07T19:45:10.5547394Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:10.5547724Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:10.5548066Z #define __LDBL_DIG__ 18 2025-05-07T19:45:10.5548356Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:10.5548700Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:10.5548965Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:10.5549253Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.5549522Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:10.5549796Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:10.5550067Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:10.5550368Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:10.5550691Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:10.5550988Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:10.5551299Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:10.5551617Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:10.5551886Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:10.5552163Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:10.5552598Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:10.5552869Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:10.5553107Z #define __LP64__ 1 2025-05-07T19:45:10.5553310Z #define __MMX__ 1 2025-05-07T19:45:10.5553526Z #define __NO_INLINE__ 1 2025-05-07T19:45:10.5553751Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:10.5554010Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:10.5554298Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:10.5554609Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:10.5554913Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:10.5555211Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:10.5555519Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:10.5555806Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:10.5556083Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:10.5556353Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:10.5556612Z #define __PIC__ 2 2025-05-07T19:45:10.5556812Z #define __PIE__ 2 2025-05-07T19:45:10.5557036Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:10.5557303Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:10.5557574Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:10.5557849Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:10.5558111Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:10.5558493Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:10.5558759Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:10.5559021Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:10.5559259Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:10.5559497Z #define __SEG_FS 1 2025-05-07T19:45:10.5559697Z #define __SEG_GS 1 2025-05-07T19:45:10.5559916Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:10.5560162Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:10.5560396Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:10.5560679Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:10.5560928Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:10.5561184Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:10.5561432Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:10.5561681Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:10.5561922Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:10.5562259Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:10.5562521Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:10.5562794Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:10.5563047Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:10.5563298Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:10.5563563Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:10.5563799Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:10.5564059Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:10.5564305Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:10.5564560Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:10.5564794Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:10.5565048Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:10.5565283Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:10.5565553Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.5565847Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:10.5566141Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:10.5566387Z #define __SSE2_MATH__ 1 2025-05-07T19:45:10.5566606Z #define __SSE2__ 1 2025-05-07T19:45:10.5566837Z #define __SSE_MATH__ 1 2025-05-07T19:45:10.5567047Z #define __SSE__ 1 2025-05-07T19:45:10.5567303Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:45:10.5567602Z #define __STDCPP_THREADS__ 1 2025-05-07T19:45:10.5567856Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:10.5568080Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:10.5568316Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:10.5568533Z #define __STDC__ 1 2025-05-07T19:45:10.5568751Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:10.5569001Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:10.5569236Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:10.5569484Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:10.5569720Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:10.5569970Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:10.5570217Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:10.5570505Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:10.5570750Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:10.5571005Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:10.5571240Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:10.5571487Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:10.5571741Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:10.5572001Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:10.5572280Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:10.5572522Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:10.5572777Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:10.5573013Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:10.5573260Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:10.5573514Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.5573828Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:10.5574107Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:10.5574358Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:10.5574611Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:10.5574847Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:10.5575098Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:10.5575332Z #define __UINT8_MAX__ 255 2025-05-07T19:45:10.5575587Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:10.5575855Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:10.5576237Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:10.5576487Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:10.5576752Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:10.5577058Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:10.5577335Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.5577640Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:10.5577941Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:10.5578187Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:10.5578452Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:10.5578700Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:10.5578956Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:10.5579218Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.5579535Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:10.5579901Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:10.5580422Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:10.5580721Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:10.5581064Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:10.5581354Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:10.5581625Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:10.5581931Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:10.5582238Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:10.5582526Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:10.5582794Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:10.5583075Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:10.5583351Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:10.5583672Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:10.5583993Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:10.5584271Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:10.5584561Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:10.5584838Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:10.5585160Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.5585511Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:10.5585847Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:10.5586122Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:10.5586408Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:10.5586697Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:10.5586972Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:10.5587263Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:10.5587572Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:10.5587892Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:10.5588196Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:10.5588531Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:10.5588831Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:10.5589177Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:10.5589515Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:10.5589840Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:10.5590163Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:10.5590463Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:10.5590793Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:10.5591129Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:10.5591487Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:10.5591793Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:10.5592124Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:10.5592528Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:10.5592868Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.5593249Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:10.5593571Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:10.5593894Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:10.5594184Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:10.5594492Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:10.5594775Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:10.5595112Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:10.5595430Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:10.5596135Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:10.5596780Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:10.5597062Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:10.5597348Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:10.5597600Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:10.5597898Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:10.5598160Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:10.5598410Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:10.5598629Z #define __amd64 1 2025-05-07T19:45:10.5598840Z #define __amd64__ 1 2025-05-07T19:45:10.5599038Z #define __clang__ 1 2025-05-07T19:45:10.5599279Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:10.5599562Z #define __clang_major__ 16 2025-05-07T19:45:10.5599806Z #define __clang_minor__ 0 2025-05-07T19:45:10.5600122Z #define __clang_patchlevel__ 6 2025-05-07T19:45:10.5600698Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:10.5601373Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:10.5601700Z #define __code_model_small__ 1 2025-05-07T19:45:10.5601994Z #define __cplusplus 201703L 2025-05-07T19:45:10.5602272Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:45:10.5602601Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:45:10.5602894Z #define __cpp_alias_templates 200704L 2025-05-07T19:45:10.5603184Z #define __cpp_aligned_new 201606L 2025-05-07T19:45:10.5603466Z #define __cpp_attributes 200809L 2025-05-07T19:45:10.5603731Z #define __cpp_binary_literals 201304L 2025-05-07T19:45:10.5604024Z #define __cpp_capture_star_this 201603L 2025-05-07T19:45:10.5604308Z #define __cpp_constexpr 201603L 2025-05-07T19:45:10.5604597Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:45:10.5604892Z #define __cpp_decltype 200707L 2025-05-07T19:45:10.5605166Z #define __cpp_decltype_auto 201304L 2025-05-07T19:45:10.5605463Z #define __cpp_deduction_guides 201703L 2025-05-07T19:45:10.5605780Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:45:10.5606261Z #define __cpp_digit_separators 201309L 2025-05-07T19:45:10.5606565Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:45:10.5606893Z #define __cpp_exceptions 199711L 2025-05-07T19:45:10.5607158Z #define __cpp_fold_expressions 201603L 2025-05-07T19:45:10.5607453Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:45:10.5607747Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:45:10.5608057Z #define __cpp_hex_float 201603L 2025-05-07T19:45:10.5608314Z #define __cpp_if_constexpr 201606L 2025-05-07T19:45:10.5608610Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:45:10.5608924Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:45:10.5609238Z #define __cpp_init_captures 201304L 2025-05-07T19:45:10.5609531Z #define __cpp_initializer_lists 200806L 2025-05-07T19:45:10.5609819Z #define __cpp_inline_variables 201606L 2025-05-07T19:45:10.5610113Z #define __cpp_lambdas 200907L 2025-05-07T19:45:10.5610386Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:45:10.5610711Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:45:10.5611033Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:45:10.5611384Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:45:10.5611692Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:45:10.5612039Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:45:10.5612374Z #define __cpp_nsdmi 200809L 2025-05-07T19:45:10.5612619Z #define __cpp_range_based_for 201603L 2025-05-07T19:45:10.5612905Z #define __cpp_raw_strings 200710L 2025-05-07T19:45:10.5613170Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:45:10.5613468Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:45:10.5613753Z #define __cpp_rtti 199711L 2025-05-07T19:45:10.5614014Z #define __cpp_rvalue_references 200610L 2025-05-07T19:45:10.5614294Z #define __cpp_static_assert 201411L 2025-05-07T19:45:10.5614667Z #define __cpp_static_call_operator 202207L 2025-05-07T19:45:10.5614982Z #define __cpp_structured_bindings 201606L 2025-05-07T19:45:10.5615270Z #define __cpp_template_auto 201606L 2025-05-07T19:45:10.5615570Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:45:10.5615871Z #define __cpp_unicode_characters 200704L 2025-05-07T19:45:10.5616184Z #define __cpp_unicode_literals 200710L 2025-05-07T19:45:10.5616474Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:45:10.5616789Z #define __cpp_variable_templates 201304L 2025-05-07T19:45:10.5617085Z #define __cpp_variadic_templates 200704L 2025-05-07T19:45:10.5617386Z #define __cpp_variadic_using 201611L 2025-05-07T19:45:10.5617649Z #define __gnu_linux__ 1 2025-05-07T19:45:10.5617878Z #define __k8 1 2025-05-07T19:45:10.5618081Z #define __k8__ 1 2025-05-07T19:45:10.5618277Z #define __linux 1 2025-05-07T19:45:10.5618551Z #define __linux__ 1 2025-05-07T19:45:10.5618751Z #define __llvm__ 1 2025-05-07T19:45:10.5618963Z #define __pic__ 2 2025-05-07T19:45:10.5619160Z #define __pie__ 2 2025-05-07T19:45:10.5619386Z #define __private_extern__ extern 2025-05-07T19:45:10.5619764Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:10.5620315Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:10.5620649Z #define __tune_k8__ 1 2025-05-07T19:45:10.5620984Z #define __unix 1 2025-05-07T19:45:10.5621198Z #define __unix__ 1 2025-05-07T19:45:10.5621430Z #define __x86_64 1 2025-05-07T19:45:10.5621669Z #define __x86_64__ 1 2025-05-07T19:45:10.5621890Z #define linux 1 2025-05-07T19:45:10.5622300Z #define unix 1 2025-05-07T19:45:10.5622428Z 2025-05-07T19:45:10.6273148Z 2025-05-07T19:45:10.6273582Z + conda run -n build_binary c++ --version 2025-05-07T19:45:10.6273812Z 2025-05-07T19:45:12.2384894Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:45:12.2386730Z Target: x86_64-conda-linux-gnu 2025-05-07T19:45:12.2387555Z Thread model: posix 2025-05-07T19:45:12.2388468Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:45:12.2390322Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:45:12.2391686Z 2025-05-07T19:45:12.2955337Z 2025-05-07T19:45:12.2955985Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:45:12.2956560Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:45:12.2956908Z 2025-05-07T19:45:13.9870124Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:13.9875534Z 2025-05-07T19:45:13.9876335Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:45:13.9878192Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:45:13.9894611Z 2025-05-07T19:45:15.6685324Z #define __cplusplus 201703L 2025-05-07T19:45:15.6689083Z 2025-05-07T19:45:15.6690071Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:45:15.6767699Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:15.6768235Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:15.6769149Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:15.6769525Z env: 2025-05-07T19:45:15.6769803Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:15.6770130Z BUILD_ENV: build_binary 2025-05-07T19:45:15.6770428Z BUILD_TARGET: default 2025-05-07T19:45:15.6770679Z BUILD_VARIANT: cuda 2025-05-07T19:45:15.6770960Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:15.6771225Z ##[endgroup] 2025-05-07T19:45:16.0811688Z ################################################################################ 2025-05-07T19:45:16.0812869Z # Install Build Tools 2025-05-07T19:45:16.0813569Z # 2025-05-07T19:45:16.0823860Z # [2025-05-07T19:45:16.081Z] + install_build_tools build_binary 2025-05-07T19:45:16.0824318Z ################################################################################ 2025-05-07T19:45:16.0824981Z 2025-05-07T19:45:16.0843204Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:16.1665912Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:16.1669644Z [INSTALL] Installing build tools ... 2025-05-07T19:45:16.1695860Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:45:16.8863521Z Channels: 2025-05-07T19:45:16.8863809Z - conda-forge 2025-05-07T19:45:16.8864069Z Platform: linux-64 2025-05-07T19:45:19.9037057Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:23.5903647Z Solving environment: \ | / - done 2025-05-07T19:45:23.6517214Z 2025-05-07T19:45:23.6517448Z ## Package Plan ## 2025-05-07T19:45:23.6517632Z 2025-05-07T19:45:23.6517914Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:23.6518279Z 2025-05-07T19:45:23.6518411Z added / updated specs: 2025-05-07T19:45:23.6518686Z - auditwheel 2025-05-07T19:45:23.6518902Z - bazel 2025-05-07T19:45:23.6519130Z - cmake[version='>=3.30'] 2025-05-07T19:45:23.6519384Z - hypothesis 2025-05-07T19:45:23.6519606Z - jinja2 2025-05-07T19:45:23.6519818Z - make 2025-05-07T19:45:23.6520008Z - ncurses 2025-05-07T19:45:23.6520221Z - ninja 2025-05-07T19:45:23.6520412Z - openblas 2025-05-07T19:45:23.6520633Z - patchelf 2025-05-07T19:45:23.6520834Z - pyyaml 2025-05-07T19:45:23.6521041Z - rhash 2025-05-07T19:45:23.6521234Z - scikit-build 2025-05-07T19:45:23.6521455Z - wheel 2025-05-07T19:45:23.6521565Z 2025-05-07T19:45:23.6521569Z 2025-05-07T19:45:23.6521688Z The following packages will be downloaded: 2025-05-07T19:45:23.6521923Z 2025-05-07T19:45:23.6522247Z package | build 2025-05-07T19:45:23.6522592Z ---------------------------|----------------- 2025-05-07T19:45:23.6522990Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:23.6523439Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:23.6523879Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:23.6524321Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:23.6524730Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:23.6525151Z cairo-1.18.0 | hbb29018_2 961 KB conda-forge 2025-05-07T19:45:23.6525567Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:23.6525971Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:23.6526394Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:23.6527215Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:23.6527842Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:23.6528406Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:23.6528966Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:23.6529483Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:23.6529966Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:23.6530449Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:23.6530956Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:23.6531423Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:23.6531845Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:23.6532422Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:23.6532862Z harfbuzz-9.0.0 | hfac3d4d_0 1.5 MB conda-forge 2025-05-07T19:45:23.6533333Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:23.6533872Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:23.6534287Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:23.6534716Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:23.6535114Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:23.6535603Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:23.6535966Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:23.6536393Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:23.6536826Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:23.6537228Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:23.6537638Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:23.6538073Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:23.6538497Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:23.6538881Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:45:23.6539305Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:23.6539989Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:23.6540450Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:23.6540940Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:23.6541388Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:23.6541827Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:23.6542271Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:23.6542728Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:23.6543182Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:23.6543640Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:23.6544080Z libnsl-2.0.1 | hd590300_0 33 KB conda-forge 2025-05-07T19:45:23.6544534Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:23.6545002Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:23.6545560Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:23.6546029Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:23.6546488Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:45:23.6546918Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:23.6547349Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:23.6547768Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:45:23.6548195Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:23.6548640Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:23.6549077Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:23.6549512Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:23.6549992Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:23.6550434Z markupsafe-3.0.2 | py310h89163eb_1 23 KB conda-forge 2025-05-07T19:45:23.6550867Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:23.6551290Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:23.6551748Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:23.6552303Z openjdk-23.0.1 | h4c11d01_0 181.3 MB conda-forge 2025-05-07T19:45:23.6552831Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:23.6553236Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:23.6553636Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:23.6554037Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:23.6554449Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:23.6554888Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:23.6555308Z python-3.10.17 |hd6af730_0_cpython 23.9 MB conda-forge 2025-05-07T19:45:23.6555726Z pyyaml-6.0.2 | py310h89163eb_2 178 KB conda-forge 2025-05-07T19:45:23.6556109Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:23.6556496Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:23.6556918Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:23.6557339Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:23.6557796Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:23.6558224Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:45:23.6558613Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:23.6558987Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:23.6559382Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:23.6559797Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:23.6560206Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:23.6560629Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:23.6561045Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:23.6561486Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:23.6561996Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:23.6562439Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:23.6562881Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:23.6563305Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:23.6563769Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:23.6564218Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:23.6564641Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:23.6565048Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:23.6565432Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:23.6565853Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:23.6566303Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:23.6566662Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:23.6567034Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:23.6567392Z ------------------------------------------------------------ 2025-05-07T19:45:23.6567731Z Total: 331.2 MB 2025-05-07T19:45:23.6567934Z 2025-05-07T19:45:23.6568057Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:23.6568286Z 2025-05-07T19:45:23.6568492Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:23.6568933Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:23.6569367Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:23.6569814Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:23.6570214Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:23.6570813Z cairo conda-forge/linux-64::cairo-1.18.0-hbb29018_2 2025-05-07T19:45:23.6571246Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:23.6571848Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:23.6572331Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:23.6572869Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:23.6573545Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:23.6574252Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:23.6574912Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:23.6575581Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:23.6576139Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:23.6576707Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:23.6577272Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:23.6577775Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:23.6578266Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:23.6578757Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:23.6579287Z harfbuzz conda-forge/linux-64::harfbuzz-9.0.0-hfac3d4d_0 2025-05-07T19:45:23.6579885Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:23.6580350Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:23.6580781Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:23.6581320Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:23.6581771Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:23.6582196Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:23.6582601Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:23.6583095Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:23.6583599Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:23.6584061Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:23.6584528Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:23.6585052Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:23.6585579Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:23.6586122Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:45:23.6586675Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:23.6587234Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:23.6587831Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:23.6588426Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:23.6588958Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:23.6589467Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:23.6589988Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:23.6590543Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:23.6591090Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:23.6591646Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:23.6592244Z libnsl conda-forge/linux-64::libnsl-2.0.1-hd590300_0 2025-05-07T19:45:23.6592742Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:23.6593273Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:23.6593763Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:23.6594254Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:23.6594746Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:45:23.6595206Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:23.6595668Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:23.6596128Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:23.6596606Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:23.6597118Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:23.6597535Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:23.6598030Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py310h89163eb_1 2025-05-07T19:45:23.6598527Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:23.6598997Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:23.6599514Z openjdk conda-forge/linux-64::openjdk-23.0.1-h4c11d01_0 2025-05-07T19:45:23.6599970Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:23.6600463Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:23.6600928Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:23.6601408Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:23.6601925Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:23.6602440Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:23.6602935Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py310h89163eb_2 2025-05-07T19:45:23.6603529Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:23.6603989Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:23.6604508Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:23.6605199Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:23.6605797Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:23.6606348Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:23.6606867Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:23.6607467Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:23.6607994Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:23.6608555Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:23.6609107Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:23.6609695Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:23.6610289Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:23.6610833Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:23.6611401Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:23.6611997Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:23.6612588Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:23.6613151Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:23.6613693Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:23.6614226Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:23.6614693Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:23.6614991Z 2025-05-07T19:45:23.6615125Z The following packages will be UPDATED: 2025-05-07T19:45:23.6615348Z 2025-05-07T19:45:23.6615675Z libuuid pkgs/main::libuuid-1.41.5-h5eee18b_0 --> conda-forge::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:45:23.6616246Z libzlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:23.6616822Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:23.6617526Z python pkgs/main::python-3.10.16-he870216_1 --> conda-forge::python-3.10.17-hd6af730_0_cpython 2025-05-07T19:45:23.6618253Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:45:23.6618979Z wheel pkgs/main/linux-64::wheel-0.45.1-py31~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:23.6619695Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:23.6620212Z zlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:23.6620655Z zstd 1.5.6-ha6fb4c9_0 --> 1.5.7-hb8e6e7a_2 2025-05-07T19:45:23.6620919Z 2025-05-07T19:45:23.6621162Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:23.6621507Z 2025-05-07T19:45:23.6621788Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:23.6622273Z 2025-05-07T19:45:23.6622300Z 2025-05-07T19:45:23.6622432Z 2025-05-07T19:45:23.6622623Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:23.6623063Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:23.6623352Z 2025-05-07T19:45:23.6623744Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:23.6623996Z 2025-05-07T19:45:23.6624000Z 2025-05-07T19:45:23.6631708Z python-3.10.17 | 23.9 MB | | 0%  2025-05-07T19:45:23.6632010Z 2025-05-07T19:45:23.6632014Z 2025-05-07T19:45:23.6632076Z 2025-05-07T19:45:23.6641172Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:23.6641514Z 2025-05-07T19:45:23.6641612Z 2025-05-07T19:45:23.6641615Z 2025-05-07T19:45:23.6641660Z 2025-05-07T19:45:23.6661941Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:23.6662293Z 2025-05-07T19:45:23.6662298Z 2025-05-07T19:45:23.6662302Z 2025-05-07T19:45:23.6662305Z 2025-05-07T19:45:23.6662309Z 2025-05-07T19:45:23.6663308Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:23.6663601Z 2025-05-07T19:45:23.6663605Z 2025-05-07T19:45:23.6663639Z 2025-05-07T19:45:23.6663642Z 2025-05-07T19:45:23.6663646Z 2025-05-07T19:45:23.6663649Z 2025-05-07T19:45:23.6664485Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:23.6664793Z 2025-05-07T19:45:23.6664805Z 2025-05-07T19:45:23.6664808Z 2025-05-07T19:45:23.6664842Z 2025-05-07T19:45:23.6664845Z 2025-05-07T19:45:23.6664848Z 2025-05-07T19:45:23.6664852Z 2025-05-07T19:45:23.6665567Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:23.6665869Z 2025-05-07T19:45:23.6665873Z 2025-05-07T19:45:23.6665876Z 2025-05-07T19:45:23.6665880Z 2025-05-07T19:45:23.6665915Z 2025-05-07T19:45:23.6665919Z 2025-05-07T19:45:23.6665922Z 2025-05-07T19:45:23.6665926Z 2025-05-07T19:45:23.6666686Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:23.6666993Z 2025-05-07T19:45:23.6666996Z 2025-05-07T19:45:23.6667000Z 2025-05-07T19:45:23.6667003Z 2025-05-07T19:45:23.6667037Z 2025-05-07T19:45:23.6667040Z 2025-05-07T19:45:23.6667044Z 2025-05-07T19:45:23.6667047Z 2025-05-07T19:45:23.6667050Z 2025-05-07T19:45:23.6668119Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:23.6668433Z 2025-05-07T19:45:23.6668437Z 2025-05-07T19:45:23.6668441Z 2025-05-07T19:45:23.6668444Z 2025-05-07T19:45:23.6668480Z 2025-05-07T19:45:23.6668483Z 2025-05-07T19:45:23.6668487Z 2025-05-07T19:45:23.6668495Z 2025-05-07T19:45:23.6668498Z 2025-05-07T19:45:23.6668501Z 2025-05-07T19:45:23.6670205Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:23.6670895Z 2025-05-07T19:45:23.6670908Z 2025-05-07T19:45:23.6670920Z 2025-05-07T19:45:23.6670927Z 2025-05-07T19:45:23.6670937Z 2025-05-07T19:45:23.6670943Z 2025-05-07T19:45:23.6670953Z 2025-05-07T19:45:23.6671009Z 2025-05-07T19:45:23.6671037Z 2025-05-07T19:45:23.6671042Z 2025-05-07T19:45:23.6671084Z 2025-05-07T19:45:23.6671618Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:23.6672245Z 2025-05-07T19:45:23.6672250Z 2025-05-07T19:45:23.6672256Z 2025-05-07T19:45:23.6672261Z 2025-05-07T19:45:23.6672266Z 2025-05-07T19:45:23.6672271Z 2025-05-07T19:45:23.6672278Z 2025-05-07T19:45:23.6672285Z 2025-05-07T19:45:23.6672290Z 2025-05-07T19:45:23.6672296Z 2025-05-07T19:45:23.6672303Z 2025-05-07T19:45:23.6672308Z 2025-05-07T19:45:23.6672847Z harfbuzz-9.0.0 | 1.5 MB | | 0%  2025-05-07T19:45:23.6673391Z 2025-05-07T19:45:23.6673396Z 2025-05-07T19:45:23.6673401Z 2025-05-07T19:45:23.6673435Z 2025-05-07T19:45:23.6673440Z 2025-05-07T19:45:23.6673445Z 2025-05-07T19:45:23.6673454Z 2025-05-07T19:45:23.6673478Z 2025-05-07T19:45:23.6673483Z 2025-05-07T19:45:23.6673488Z 2025-05-07T19:45:23.6673503Z 2025-05-07T19:45:23.6673861Z 2025-05-07T19:45:23.6673870Z 2025-05-07T19:45:23.6674405Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:23.6675018Z 2025-05-07T19:45:23.6675026Z 2025-05-07T19:45:23.6675035Z 2025-05-07T19:45:23.6675041Z 2025-05-07T19:45:23.6675068Z 2025-05-07T19:45:23.6675076Z 2025-05-07T19:45:23.6675084Z 2025-05-07T19:45:23.6675089Z 2025-05-07T19:45:23.6675097Z 2025-05-07T19:45:23.6675105Z 2025-05-07T19:45:23.6675114Z 2025-05-07T19:45:23.6675119Z 2025-05-07T19:45:23.6675127Z 2025-05-07T19:45:23.6675135Z 2025-05-07T19:45:23.6675517Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:23.6676081Z 2025-05-07T19:45:23.6676089Z 2025-05-07T19:45:23.6676097Z 2025-05-07T19:45:23.6676102Z 2025-05-07T19:45:23.6676110Z 2025-05-07T19:45:23.6676118Z 2025-05-07T19:45:23.6676124Z 2025-05-07T19:45:23.6676131Z 2025-05-07T19:45:23.6676140Z 2025-05-07T19:45:23.6676321Z 2025-05-07T19:45:23.6676338Z 2025-05-07T19:45:23.6676342Z 2025-05-07T19:45:23.6676346Z 2025-05-07T19:45:23.6676349Z 2025-05-07T19:45:23.6676371Z 2025-05-07T19:45:23.6676832Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:23.6677491Z 2025-05-07T19:45:23.6677498Z 2025-05-07T19:45:23.6677506Z 2025-05-07T19:45:23.6677515Z 2025-05-07T19:45:23.6677521Z 2025-05-07T19:45:23.6677529Z 2025-05-07T19:45:23.6677537Z 2025-05-07T19:45:23.6677543Z 2025-05-07T19:45:23.6677549Z 2025-05-07T19:45:23.6677557Z 2025-05-07T19:45:23.6677562Z 2025-05-07T19:45:23.6677566Z 2025-05-07T19:45:23.6677571Z 2025-05-07T19:45:23.6677576Z 2025-05-07T19:45:23.6677581Z 2025-05-07T19:45:23.6677622Z 2025-05-07T19:45:23.6678141Z cairo-1.18.0 | 961 KB | | 0%  2025-05-07T19:45:23.6678684Z 2025-05-07T19:45:23.6678687Z 2025-05-07T19:45:23.6678691Z 2025-05-07T19:45:23.6678700Z 2025-05-07T19:45:23.6678709Z 2025-05-07T19:45:23.6678713Z 2025-05-07T19:45:23.6678716Z 2025-05-07T19:45:23.6678720Z 2025-05-07T19:45:23.6678742Z 2025-05-07T19:45:23.6678745Z 2025-05-07T19:45:23.6678749Z 2025-05-07T19:45:23.6678752Z 2025-05-07T19:45:23.6678755Z 2025-05-07T19:45:23.6678759Z 2025-05-07T19:45:23.6678762Z 2025-05-07T19:45:23.6678765Z 2025-05-07T19:45:23.6678769Z 2025-05-07T19:45:23.6679078Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:23.6679409Z 2025-05-07T19:45:23.6679413Z 2025-05-07T19:45:23.6679416Z 2025-05-07T19:45:23.6679419Z 2025-05-07T19:45:23.6679423Z 2025-05-07T19:45:23.6679427Z 2025-05-07T19:45:23.6679430Z 2025-05-07T19:45:23.6679433Z 2025-05-07T19:45:23.6679437Z 2025-05-07T19:45:23.6679440Z 2025-05-07T19:45:23.6679443Z 2025-05-07T19:45:23.6679447Z 2025-05-07T19:45:23.6679450Z 2025-05-07T19:45:23.6679454Z 2025-05-07T19:45:23.6679457Z 2025-05-07T19:45:23.6679461Z 2025-05-07T19:45:23.6679469Z 2025-05-07T19:45:23.6679478Z 2025-05-07T19:45:23.6679809Z libsqlite-3.49.2 | 895 KB | | 0%  2025-05-07T19:45:23.6680132Z 2025-05-07T19:45:23.6680135Z 2025-05-07T19:45:23.6680139Z 2025-05-07T19:45:23.6680142Z 2025-05-07T19:45:23.6680146Z 2025-05-07T19:45:23.6680150Z 2025-05-07T19:45:23.6680153Z 2025-05-07T19:45:23.6680158Z 2025-05-07T19:45:23.6680162Z 2025-05-07T19:45:23.6680165Z 2025-05-07T19:45:23.6680169Z 2025-05-07T19:45:23.6680172Z 2025-05-07T19:45:23.6680194Z 2025-05-07T19:45:23.6680197Z 2025-05-07T19:45:23.6680201Z 2025-05-07T19:45:23.6680204Z 2025-05-07T19:45:23.6680208Z 2025-05-07T19:45:23.6680211Z 2025-05-07T19:45:23.6680214Z 2025-05-07T19:45:23.7976367Z ... (more hidden) ... 2025-05-07T19:45:23.7977232Z 2025-05-07T19:45:23.7977237Z 2025-05-07T19:45:23.7977241Z 2025-05-07T19:45:23.8023375Z 2025-05-07T19:45:23.8177607Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:23.8177955Z 2025-05-07T19:45:23.8177973Z 2025-05-07T19:45:23.8177977Z 2025-05-07T19:45:23.9024695Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:23.9025876Z 2025-05-07T19:45:23.9186527Z 2025-05-07T19:45:23.9186536Z 2025-05-07T19:45:23.9186542Z 2025-05-07T19:45:23.9187446Z libgrpc-1.71.0 | 7.6 MB | | 1%  2025-05-07T19:45:23.9187846Z 2025-05-07T19:45:23.9187850Z 2025-05-07T19:45:23.9187854Z 2025-05-07T19:45:24.0041228Z cmake-4.0.2 | 19.4 MB | 2 | 2%  2025-05-07T19:45:24.0041829Z 2025-05-07T19:45:24.0041836Z 2025-05-07T19:45:24.0041843Z 2025-05-07T19:45:24.0041848Z 2025-05-07T19:45:24.0042130Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:24.0042414Z 2025-05-07T19:45:24.0042418Z 2025-05-07T19:45:24.0042456Z 2025-05-07T19:45:24.0042459Z 2025-05-07T19:45:24.0080872Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:24.0081457Z 2025-05-07T19:45:24.0131704Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:24.0132300Z 2025-05-07T19:45:24.0132310Z 2025-05-07T19:45:24.0159192Z python-3.10.17 | 23.9 MB | | 0%  2025-05-07T19:45:24.0185102Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:24.0185876Z 2025-05-07T19:45:24.0185906Z 2025-05-07T19:45:24.0185918Z 2025-05-07T19:45:24.0545271Z cmake-4.0.2 | 19.4 MB | #####5 | 56%  2025-05-07T19:45:24.0545611Z 2025-05-07T19:45:24.0545616Z 2025-05-07T19:45:24.0545620Z 2025-05-07T19:45:24.0545624Z 2025-05-07T19:45:24.0545627Z 2025-05-07T19:45:24.1081953Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:24.1082326Z 2025-05-07T19:45:24.1134063Z bazel-7.5.0 | 47.4 MB | #4 | 14%  2025-05-07T19:45:24.1134581Z 2025-05-07T19:45:24.1134590Z 2025-05-07T19:45:24.1186170Z python-3.10.17 | 23.9 MB | #4 | 15%  2025-05-07T19:45:24.1186537Z 2025-05-07T19:45:24.1186541Z 2025-05-07T19:45:24.1186545Z 2025-05-07T19:45:24.1266082Z cmake-4.0.2 | 19.4 MB | ########5 | 86%  2025-05-07T19:45:24.1545656Z openjdk-23.0.1 | 181.3 MB | 1 | 1% 2025-05-07T19:45:24.1545945Z 2025-05-07T19:45:24.1545950Z 2025-05-07T19:45:24.1545956Z 2025-05-07T19:45:24.1545960Z 2025-05-07T19:45:24.1545967Z 2025-05-07T19:45:24.2014350Z openblas-0.3.29 | 5.8 MB | #########9 | 99%  2025-05-07T19:45:24.2014786Z 2025-05-07T19:45:24.2014794Z 2025-05-07T19:45:24.2014798Z 2025-05-07T19:45:24.2014803Z 2025-05-07T19:45:24.2014807Z 2025-05-07T19:45:24.2097673Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:24.2098010Z 2025-05-07T19:45:24.2459181Z bazel-7.5.0 | 47.4 MB | ##5 | 26%  2025-05-07T19:45:24.2497030Z openjdk-23.0.1 | 181.3 MB | 3 | 3% 2025-05-07T19:45:24.2497416Z 2025-05-07T19:45:24.2497576Z 2025-05-07T19:45:24.2497593Z 2025-05-07T19:45:24.2497600Z 2025-05-07T19:45:24.2497607Z 2025-05-07T19:45:24.2497615Z 2025-05-07T19:45:24.2881002Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:24.2881569Z 2025-05-07T19:45:24.2881583Z 2025-05-07T19:45:24.2881604Z 2025-05-07T19:45:24.3099318Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:24.3099700Z 2025-05-07T19:45:24.3161314Z bazel-7.5.0 | 47.4 MB | ###7 | 38%  2025-05-07T19:45:24.3161665Z 2025-05-07T19:45:24.3161671Z 2025-05-07T19:45:24.3337329Z python-3.10.17 | 23.9 MB | ###1 | 31%  2025-05-07T19:45:24.3337617Z 2025-05-07T19:45:24.3337721Z 2025-05-07T19:45:24.3337728Z 2025-05-07T19:45:24.3337806Z 2025-05-07T19:45:24.3337817Z 2025-05-07T19:45:24.3337822Z 2025-05-07T19:45:24.3337827Z 2025-05-07T19:45:24.3785499Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:24.3786878Z 2025-05-07T19:45:24.3786898Z 2025-05-07T19:45:24.3786910Z 2025-05-07T19:45:24.3786921Z 2025-05-07T19:45:24.3786931Z 2025-05-07T19:45:24.3786943Z 2025-05-07T19:45:24.3787770Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:24.3788645Z 2025-05-07T19:45:24.3788658Z 2025-05-07T19:45:24.3788668Z 2025-05-07T19:45:24.3788678Z 2025-05-07T19:45:24.3788690Z 2025-05-07T19:45:24.3788717Z 2025-05-07T19:45:24.3892555Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:24.4160204Z openjdk-23.0.1 | 181.3 MB | 4 | 4% 2025-05-07T19:45:24.4160584Z 2025-05-07T19:45:24.4160830Z 2025-05-07T19:45:24.4225672Z python-3.10.17 | 23.9 MB | ##### | 51%  2025-05-07T19:45:24.4225966Z 2025-05-07T19:45:24.4225973Z 2025-05-07T19:45:24.4225979Z 2025-05-07T19:45:24.4225986Z 2025-05-07T19:45:24.4225992Z 2025-05-07T19:45:24.4226013Z 2025-05-07T19:45:24.4226026Z 2025-05-07T19:45:24.4226639Z 2025-05-07T19:45:24.4462866Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:24.4463188Z 2025-05-07T19:45:24.4894406Z bazel-7.5.0 | 47.4 MB | ####8 | 48%  2025-05-07T19:45:24.5158447Z openjdk-23.0.1 | 181.3 MB | 5 | 5% 2025-05-07T19:45:24.5158762Z 2025-05-07T19:45:24.5158767Z 2025-05-07T19:45:24.5158772Z 2025-05-07T19:45:24.5158776Z 2025-05-07T19:45:24.5158780Z 2025-05-07T19:45:24.5158784Z 2025-05-07T19:45:24.5158787Z 2025-05-07T19:45:24.5159106Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:24.5159396Z 2025-05-07T19:45:24.5159401Z 2025-05-07T19:45:24.5159406Z 2025-05-07T19:45:24.5159410Z 2025-05-07T19:45:24.5159415Z 2025-05-07T19:45:24.5159420Z 2025-05-07T19:45:24.5159425Z 2025-05-07T19:45:24.5167180Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:24.5167998Z 2025-05-07T19:45:24.5168010Z 2025-05-07T19:45:24.5462473Z python-3.10.17 | 23.9 MB | ####### | 70%  2025-05-07T19:45:24.5462769Z 2025-05-07T19:45:24.5650717Z bazel-7.5.0 | 47.4 MB | #####8 | 59%  2025-05-07T19:45:24.5651152Z 2025-05-07T19:45:24.5651168Z 2025-05-07T19:45:24.5651176Z 2025-05-07T19:45:24.5651184Z 2025-05-07T19:45:24.5651193Z 2025-05-07T19:45:24.5651210Z 2025-05-07T19:45:24.5651217Z 2025-05-07T19:45:24.5651228Z 2025-05-07T19:45:24.5651235Z 2025-05-07T19:45:24.5692183Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:24.5693916Z 2025-05-07T19:45:24.5693932Z 2025-05-07T19:45:24.5693943Z 2025-05-07T19:45:24.5693955Z 2025-05-07T19:45:24.5693965Z 2025-05-07T19:45:24.5693977Z 2025-05-07T19:45:24.5693988Z 2025-05-07T19:45:24.5694000Z 2025-05-07T19:45:24.5694768Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:24.5695631Z 2025-05-07T19:45:24.5695642Z 2025-05-07T19:45:24.5695652Z 2025-05-07T19:45:24.5695702Z 2025-05-07T19:45:24.5695731Z 2025-05-07T19:45:24.5695742Z 2025-05-07T19:45:24.5695752Z 2025-05-07T19:45:24.5695763Z 2025-05-07T19:45:24.5987899Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:24.6171036Z openjdk-23.0.1 | 181.3 MB | 6 | 7% 2025-05-07T19:45:24.6171581Z 2025-05-07T19:45:24.6171590Z 2025-05-07T19:45:24.6171957Z python-3.10.17 | 23.9 MB | #########1 | 91%  2025-05-07T19:45:24.6172237Z 2025-05-07T19:45:24.6172241Z 2025-05-07T19:45:24.6172244Z 2025-05-07T19:45:24.6172248Z 2025-05-07T19:45:24.6172251Z 2025-05-07T19:45:24.6172254Z 2025-05-07T19:45:24.6172257Z 2025-05-07T19:45:24.6172261Z 2025-05-07T19:45:24.6172264Z 2025-05-07T19:45:24.6172834Z 2025-05-07T19:45:24.6562864Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:24.6563194Z 2025-05-07T19:45:24.7004310Z bazel-7.5.0 | 47.4 MB | ######8 | 68%  2025-05-07T19:45:24.7021551Z openjdk-23.0.1 | 181.3 MB | 8 | 8% 2025-05-07T19:45:24.7021877Z 2025-05-07T19:45:24.7021882Z 2025-05-07T19:45:24.7021885Z 2025-05-07T19:45:24.7021889Z 2025-05-07T19:45:24.7021892Z 2025-05-07T19:45:24.7021896Z 2025-05-07T19:45:24.7021899Z 2025-05-07T19:45:24.7021903Z 2025-05-07T19:45:24.7021906Z 2025-05-07T19:45:24.7022463Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:24.7022775Z 2025-05-07T19:45:24.7022778Z 2025-05-07T19:45:24.7022782Z 2025-05-07T19:45:24.7022785Z 2025-05-07T19:45:24.7022788Z 2025-05-07T19:45:24.7022792Z 2025-05-07T19:45:24.7022795Z 2025-05-07T19:45:24.7022799Z 2025-05-07T19:45:24.7022802Z 2025-05-07T19:45:24.7150549Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:24.7150890Z 2025-05-07T19:45:24.7150894Z 2025-05-07T19:45:24.7150898Z 2025-05-07T19:45:24.7150902Z 2025-05-07T19:45:24.7383517Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:24.7384112Z 2025-05-07T19:45:24.7384117Z 2025-05-07T19:45:24.7384121Z 2025-05-07T19:45:24.7384125Z 2025-05-07T19:45:24.7384129Z 2025-05-07T19:45:24.7384132Z 2025-05-07T19:45:24.7384136Z 2025-05-07T19:45:24.7384139Z 2025-05-07T19:45:24.7384143Z 2025-05-07T19:45:24.7384146Z 2025-05-07T19:45:24.7384401Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:24.7384693Z 2025-05-07T19:45:24.7384697Z 2025-05-07T19:45:24.7384700Z 2025-05-07T19:45:24.7384703Z 2025-05-07T19:45:24.7384707Z 2025-05-07T19:45:24.7384710Z 2025-05-07T19:45:24.7384713Z 2025-05-07T19:45:24.7384717Z 2025-05-07T19:45:24.7384720Z 2025-05-07T19:45:24.7384724Z 2025-05-07T19:45:24.7407508Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:24.7407844Z 2025-05-07T19:45:24.7407848Z 2025-05-07T19:45:24.7407852Z 2025-05-07T19:45:24.7407856Z 2025-05-07T19:45:24.7407859Z 2025-05-07T19:45:24.7407863Z 2025-05-07T19:45:24.7407881Z 2025-05-07T19:45:24.7407890Z 2025-05-07T19:45:24.7407901Z 2025-05-07T19:45:24.7407904Z 2025-05-07T19:45:24.7407908Z 2025-05-07T19:45:24.7565007Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:24.7565362Z 2025-05-07T19:45:24.7831528Z bazel-7.5.0 | 47.4 MB | ########2 | 82%  2025-05-07T19:45:24.7831875Z 2025-05-07T19:45:24.7831881Z 2025-05-07T19:45:24.7831885Z 2025-05-07T19:45:24.7831890Z 2025-05-07T19:45:24.7831894Z 2025-05-07T19:45:24.7831899Z 2025-05-07T19:45:24.7831904Z 2025-05-07T19:45:24.7831908Z 2025-05-07T19:45:24.7831913Z 2025-05-07T19:45:24.7831917Z 2025-05-07T19:45:24.7831922Z 2025-05-07T19:45:24.8004815Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:24.8193945Z openjdk-23.0.1 | 181.3 MB | #2 | 12% 2025-05-07T19:45:24.8194333Z 2025-05-07T19:45:24.8194506Z 2025-05-07T19:45:24.8194514Z 2025-05-07T19:45:24.8194536Z 2025-05-07T19:45:24.8194551Z 2025-05-07T19:45:24.8194556Z 2025-05-07T19:45:24.8194560Z 2025-05-07T19:45:24.8194605Z 2025-05-07T19:45:24.8194609Z 2025-05-07T19:45:24.8194614Z 2025-05-07T19:45:24.8194618Z 2025-05-07T19:45:24.8194623Z 2025-05-07T19:45:24.8457640Z harfbuzz-9.0.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:24.8457978Z 2025-05-07T19:45:24.8457983Z 2025-05-07T19:45:24.8458014Z 2025-05-07T19:45:24.8458017Z 2025-05-07T19:45:24.8458021Z 2025-05-07T19:45:24.8458024Z 2025-05-07T19:45:24.8458028Z 2025-05-07T19:45:24.8458031Z 2025-05-07T19:45:24.8458035Z 2025-05-07T19:45:24.8458038Z 2025-05-07T19:45:24.8458042Z 2025-05-07T19:45:24.8458045Z 2025-05-07T19:45:24.8458049Z 2025-05-07T19:45:24.8569649Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:24.8570607Z 2025-05-07T19:45:24.8809040Z bazel-7.5.0 | 47.4 MB | #########8 | 98%  2025-05-07T19:45:24.8810452Z 2025-05-07T19:45:24.8810514Z 2025-05-07T19:45:24.8810957Z 2025-05-07T19:45:24.8810971Z 2025-05-07T19:45:24.8810981Z 2025-05-07T19:45:24.8864236Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:24.8865543Z 2025-05-07T19:45:24.8865589Z 2025-05-07T19:45:24.8865600Z 2025-05-07T19:45:24.8865611Z 2025-05-07T19:45:24.8865621Z 2025-05-07T19:45:24.8865632Z 2025-05-07T19:45:24.8865642Z 2025-05-07T19:45:24.8865652Z 2025-05-07T19:45:24.8865663Z 2025-05-07T19:45:24.8865673Z 2025-05-07T19:45:24.8865683Z 2025-05-07T19:45:24.8865694Z 2025-05-07T19:45:24.8968339Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:24.8968916Z 2025-05-07T19:45:24.8968921Z 2025-05-07T19:45:24.8968924Z 2025-05-07T19:45:24.8968928Z 2025-05-07T19:45:24.8968931Z 2025-05-07T19:45:24.8968935Z 2025-05-07T19:45:24.8968938Z 2025-05-07T19:45:24.8968942Z 2025-05-07T19:45:24.8968945Z 2025-05-07T19:45:24.8968949Z 2025-05-07T19:45:24.8968952Z 2025-05-07T19:45:24.8969142Z 2025-05-07T19:45:24.8969146Z 2025-05-07T19:45:24.9005252Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:24.9332153Z openjdk-23.0.1 | 181.3 MB | #6 | 16% 2025-05-07T19:45:24.9333368Z 2025-05-07T19:45:24.9333394Z 2025-05-07T19:45:24.9333417Z 2025-05-07T19:45:24.9333440Z 2025-05-07T19:45:24.9333456Z 2025-05-07T19:45:24.9333477Z 2025-05-07T19:45:24.9333500Z 2025-05-07T19:45:24.9333516Z 2025-05-07T19:45:24.9333537Z 2025-05-07T19:45:24.9333560Z 2025-05-07T19:45:24.9333583Z 2025-05-07T19:45:24.9333599Z 2025-05-07T19:45:24.9333619Z 2025-05-07T19:45:24.9333642Z 2025-05-07T19:45:24.9333658Z 2025-05-07T19:45:24.9461627Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:24.9462101Z 2025-05-07T19:45:24.9462106Z 2025-05-07T19:45:24.9462110Z 2025-05-07T19:45:24.9462113Z 2025-05-07T19:45:24.9462116Z 2025-05-07T19:45:24.9462120Z 2025-05-07T19:45:24.9462135Z 2025-05-07T19:45:24.9462144Z 2025-05-07T19:45:24.9462147Z 2025-05-07T19:45:24.9462151Z 2025-05-07T19:45:24.9462154Z 2025-05-07T19:45:24.9462179Z 2025-05-07T19:45:24.9462183Z 2025-05-07T19:45:24.9462186Z 2025-05-07T19:45:24.9702306Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:24.9703782Z 2025-05-07T19:45:24.9703796Z 2025-05-07T19:45:24.9703808Z 2025-05-07T19:45:24.9703818Z 2025-05-07T19:45:24.9703855Z 2025-05-07T19:45:24.9703866Z 2025-05-07T19:45:24.9703877Z 2025-05-07T19:45:24.9703887Z 2025-05-07T19:45:24.9703897Z 2025-05-07T19:45:24.9703907Z 2025-05-07T19:45:24.9703917Z 2025-05-07T19:45:24.9703928Z 2025-05-07T19:45:24.9703938Z 2025-05-07T19:45:24.9703948Z 2025-05-07T19:45:24.9703958Z 2025-05-07T19:45:24.9821362Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:24.9821866Z 2025-05-07T19:45:24.9821871Z 2025-05-07T19:45:24.9821888Z 2025-05-07T19:45:24.9821898Z 2025-05-07T19:45:24.9821902Z 2025-05-07T19:45:24.9821905Z 2025-05-07T19:45:24.9821909Z 2025-05-07T19:45:24.9821913Z 2025-05-07T19:45:24.9821916Z 2025-05-07T19:45:24.9821920Z 2025-05-07T19:45:24.9821923Z 2025-05-07T19:45:24.9821927Z 2025-05-07T19:45:24.9821930Z 2025-05-07T19:45:24.9821934Z 2025-05-07T19:45:25.0006544Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:25.0180258Z openjdk-23.0.1 | 181.3 MB | ##1 | 21% 2025-05-07T19:45:25.0180839Z 2025-05-07T19:45:25.0180846Z 2025-05-07T19:45:25.0180852Z 2025-05-07T19:45:25.0180857Z 2025-05-07T19:45:25.0180863Z 2025-05-07T19:45:25.0180868Z 2025-05-07T19:45:25.0180874Z 2025-05-07T19:45:25.0180880Z 2025-05-07T19:45:25.0180885Z 2025-05-07T19:45:25.0180891Z 2025-05-07T19:45:25.0180896Z 2025-05-07T19:45:25.0180901Z 2025-05-07T19:45:25.0180907Z 2025-05-07T19:45:25.0180913Z 2025-05-07T19:45:25.0180919Z 2025-05-07T19:45:25.0180923Z 2025-05-07T19:45:25.0407762Z cairo-1.18.0 | 961 KB | 1 | 2%  2025-05-07T19:45:25.0408135Z 2025-05-07T19:45:25.0408140Z 2025-05-07T19:45:25.0408143Z 2025-05-07T19:45:25.0408147Z 2025-05-07T19:45:25.0408151Z 2025-05-07T19:45:25.0408154Z 2025-05-07T19:45:25.0408158Z 2025-05-07T19:45:25.0408161Z 2025-05-07T19:45:25.0408164Z 2025-05-07T19:45:25.0408168Z 2025-05-07T19:45:25.0408171Z 2025-05-07T19:45:25.0408175Z 2025-05-07T19:45:25.0408178Z 2025-05-07T19:45:25.0408182Z 2025-05-07T19:45:25.0408185Z 2025-05-07T19:45:25.0408217Z 2025-05-07T19:45:25.0438361Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:25.0438765Z 2025-05-07T19:45:25.0438772Z 2025-05-07T19:45:25.0438777Z 2025-05-07T19:45:25.0438782Z 2025-05-07T19:45:25.0438787Z 2025-05-07T19:45:25.0438792Z 2025-05-07T19:45:25.0438823Z 2025-05-07T19:45:25.0438842Z 2025-05-07T19:45:25.0438847Z 2025-05-07T19:45:25.0439090Z 2025-05-07T19:45:25.0439104Z 2025-05-07T19:45:25.0439112Z 2025-05-07T19:45:25.0439119Z 2025-05-07T19:45:25.0439124Z 2025-05-07T19:45:25.0439131Z 2025-05-07T19:45:25.0439139Z 2025-05-07T19:45:25.0439144Z 2025-05-07T19:45:25.0555461Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:25.0556019Z 2025-05-07T19:45:25.0556023Z 2025-05-07T19:45:25.0701636Z python-3.10.17 | 23.9 MB | ########## | 100%  2025-05-07T19:45:25.0702154Z 2025-05-07T19:45:25.0702158Z 2025-05-07T19:45:25.0702162Z 2025-05-07T19:45:25.0702165Z 2025-05-07T19:45:25.0702169Z 2025-05-07T19:45:25.0702172Z 2025-05-07T19:45:25.0702176Z 2025-05-07T19:45:25.0702179Z 2025-05-07T19:45:25.0702183Z 2025-05-07T19:45:25.0702186Z 2025-05-07T19:45:25.0702189Z 2025-05-07T19:45:25.0702193Z 2025-05-07T19:45:25.0702196Z 2025-05-07T19:45:25.0702199Z 2025-05-07T19:45:25.0702203Z 2025-05-07T19:45:25.0702206Z 2025-05-07T19:45:25.0702210Z 2025-05-07T19:45:25.0785025Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:25.0785484Z 2025-05-07T19:45:25.0785489Z 2025-05-07T19:45:25.0785492Z 2025-05-07T19:45:25.0785496Z 2025-05-07T19:45:25.0785499Z 2025-05-07T19:45:25.0785502Z 2025-05-07T19:45:25.0785506Z 2025-05-07T19:45:25.0785509Z 2025-05-07T19:45:25.0785513Z 2025-05-07T19:45:25.0785516Z 2025-05-07T19:45:25.0785519Z 2025-05-07T19:45:25.0785523Z 2025-05-07T19:45:25.0785526Z 2025-05-07T19:45:25.0785549Z 2025-05-07T19:45:25.0785553Z 2025-05-07T19:45:25.0785556Z 2025-05-07T19:45:25.0785559Z 2025-05-07T19:45:25.0785563Z 2025-05-07T19:45:25.0887650Z libsqlite-3.49.2 | 895 KB | 1 | 2%  2025-05-07T19:45:25.0889077Z 2025-05-07T19:45:25.0889091Z 2025-05-07T19:45:25.0889130Z 2025-05-07T19:45:25.0889140Z 2025-05-07T19:45:25.0889151Z 2025-05-07T19:45:25.0889161Z 2025-05-07T19:45:25.0889171Z 2025-05-07T19:45:25.0889211Z 2025-05-07T19:45:25.0889235Z 2025-05-07T19:45:25.0889246Z 2025-05-07T19:45:25.0889256Z 2025-05-07T19:45:25.0889266Z 2025-05-07T19:45:25.0889277Z 2025-05-07T19:45:25.0889287Z 2025-05-07T19:45:25.0889297Z 2025-05-07T19:45:25.0889307Z 2025-05-07T19:45:25.0889317Z 2025-05-07T19:45:25.0889327Z 2025-05-07T19:45:25.0889338Z 2025-05-07T19:45:25.0984961Z ... (more hidden) ... 2025-05-07T19:45:25.0986227Z 2025-05-07T19:45:25.0986242Z 2025-05-07T19:45:25.0986255Z 2025-05-07T19:45:25.0986265Z 2025-05-07T19:45:25.0986276Z 2025-05-07T19:45:25.0986286Z 2025-05-07T19:45:25.0987485Z 2025-05-07T19:45:25.1006905Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:25.1101647Z openjdk-23.0.1 | 181.3 MB | ##5 | 26% 2025-05-07T19:45:25.1102177Z 2025-05-07T19:45:25.1102186Z 2025-05-07T19:45:25.1102194Z 2025-05-07T19:45:25.1102199Z 2025-05-07T19:45:25.1102207Z 2025-05-07T19:45:25.1102233Z 2025-05-07T19:45:25.1102461Z 2025-05-07T19:45:25.1102468Z 2025-05-07T19:45:25.1102471Z 2025-05-07T19:45:25.1102474Z 2025-05-07T19:45:25.1102478Z 2025-05-07T19:45:25.1102481Z 2025-05-07T19:45:25.1102509Z 2025-05-07T19:45:25.1102512Z 2025-05-07T19:45:25.1102516Z 2025-05-07T19:45:25.1102519Z 2025-05-07T19:45:25.1102522Z 2025-05-07T19:45:25.1102526Z 2025-05-07T19:45:25.1172064Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:25.1172700Z 2025-05-07T19:45:25.1172705Z 2025-05-07T19:45:25.1172709Z 2025-05-07T19:45:25.1172712Z 2025-05-07T19:45:25.1172715Z 2025-05-07T19:45:25.1172719Z 2025-05-07T19:45:25.1172722Z 2025-05-07T19:45:25.1172726Z 2025-05-07T19:45:25.1172729Z 2025-05-07T19:45:25.1172732Z 2025-05-07T19:45:25.1172736Z 2025-05-07T19:45:25.1172739Z 2025-05-07T19:45:25.1172743Z 2025-05-07T19:45:25.1172746Z 2025-05-07T19:45:25.1172750Z 2025-05-07T19:45:25.1172753Z 2025-05-07T19:45:25.1172756Z 2025-05-07T19:45:25.1172967Z 2025-05-07T19:45:25.1172970Z 2025-05-07T19:45:25.1478369Z ... (more hidden) ... 2025-05-07T19:45:25.1479883Z 2025-05-07T19:45:25.1479887Z 2025-05-07T19:45:25.1479891Z 2025-05-07T19:45:25.1479894Z 2025-05-07T19:45:25.1479898Z 2025-05-07T19:45:25.1479901Z 2025-05-07T19:45:25.2150324Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:25.3149635Z openjdk-23.0.1 | 181.3 MB | ##9 | 29% 2025-05-07T19:45:25.4157895Z openjdk-23.0.1 | 181.3 MB | ###3 | 34% 2025-05-07T19:45:25.5063272Z openjdk-23.0.1 | 181.3 MB | ###9 | 40% 2025-05-07T19:45:25.5063815Z 2025-05-07T19:45:25.5063824Z 2025-05-07T19:45:25.5063829Z 2025-05-07T19:45:25.5063834Z 2025-05-07T19:45:25.5063838Z 2025-05-07T19:45:25.5063844Z 2025-05-07T19:45:25.5063849Z 2025-05-07T19:45:25.5063857Z 2025-05-07T19:45:25.5582821Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:25.5707230Z openjdk-23.0.1 | 181.3 MB | ####3 | 44% 2025-05-07T19:45:25.5708038Z 2025-05-07T19:45:25.6867782Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:25.6868251Z 2025-05-07T19:45:25.6868278Z 2025-05-07T19:45:25.6868282Z 2025-05-07T19:45:25.6868285Z 2025-05-07T19:45:25.6868289Z 2025-05-07T19:45:25.6868292Z 2025-05-07T19:45:25.6868296Z 2025-05-07T19:45:25.6868299Z 2025-05-07T19:45:25.6868303Z 2025-05-07T19:45:25.6987799Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:25.7645074Z openjdk-23.0.1 | 181.3 MB | ####7 | 48% 2025-05-07T19:45:25.7646566Z 2025-05-07T19:45:25.7646592Z 2025-05-07T19:45:25.7646615Z 2025-05-07T19:45:25.7646631Z 2025-05-07T19:45:25.7646652Z 2025-05-07T19:45:25.7646676Z 2025-05-07T19:45:25.7646698Z 2025-05-07T19:45:25.7646715Z 2025-05-07T19:45:25.7646781Z 2025-05-07T19:45:25.7646801Z 2025-05-07T19:45:25.7646825Z 2025-05-07T19:45:25.7648666Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.7649608Z 2025-05-07T19:45:25.7649619Z 2025-05-07T19:45:25.7649630Z 2025-05-07T19:45:25.7649640Z 2025-05-07T19:45:25.7649650Z 2025-05-07T19:45:25.7649661Z 2025-05-07T19:45:25.7649671Z 2025-05-07T19:45:25.7649708Z 2025-05-07T19:45:25.7649718Z 2025-05-07T19:45:25.7649728Z 2025-05-07T19:45:25.7649737Z 2025-05-07T19:45:25.8102064Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.8687956Z openjdk-23.0.1 | 181.3 MB | #####1 | 52% 2025-05-07T19:45:25.8689411Z 2025-05-07T19:45:25.8689437Z 2025-05-07T19:45:25.8689460Z 2025-05-07T19:45:25.8689483Z 2025-05-07T19:45:25.8689501Z 2025-05-07T19:45:25.8689521Z 2025-05-07T19:45:25.8689545Z 2025-05-07T19:45:25.8689567Z 2025-05-07T19:45:25.8689583Z 2025-05-07T19:45:25.8689605Z 2025-05-07T19:45:25.8689631Z 2025-05-07T19:45:25.8689683Z 2025-05-07T19:45:25.8691762Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.8692679Z 2025-05-07T19:45:25.8692690Z 2025-05-07T19:45:25.8692700Z 2025-05-07T19:45:25.8692711Z 2025-05-07T19:45:25.8692721Z 2025-05-07T19:45:25.8692731Z 2025-05-07T19:45:25.8692742Z 2025-05-07T19:45:25.8692752Z 2025-05-07T19:45:25.8692762Z 2025-05-07T19:45:25.8692801Z 2025-05-07T19:45:25.8692811Z 2025-05-07T19:45:25.8692822Z 2025-05-07T19:45:25.9485123Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.9550593Z openjdk-23.0.1 | 181.3 MB | #####5 | 55% 2025-05-07T19:45:25.9551125Z 2025-05-07T19:45:25.9551134Z 2025-05-07T19:45:25.9551143Z 2025-05-07T19:45:25.9551149Z 2025-05-07T19:45:25.9551156Z 2025-05-07T19:45:25.9551163Z 2025-05-07T19:45:25.9551171Z 2025-05-07T19:45:25.9551176Z 2025-05-07T19:45:25.9551184Z 2025-05-07T19:45:25.9551289Z 2025-05-07T19:45:25.9692847Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:25.9693577Z 2025-05-07T19:45:25.9693582Z 2025-05-07T19:45:25.9693585Z 2025-05-07T19:45:25.9693589Z 2025-05-07T19:45:25.9693593Z 2025-05-07T19:45:25.9693596Z 2025-05-07T19:45:25.9693600Z 2025-05-07T19:45:25.9693603Z 2025-05-07T19:45:25.9693607Z 2025-05-07T19:45:25.9693611Z 2025-05-07T19:45:25.9693634Z 2025-05-07T19:45:25.9693638Z 2025-05-07T19:45:25.9693641Z 2025-05-07T19:45:25.9693968Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.9694295Z 2025-05-07T19:45:25.9694299Z 2025-05-07T19:45:25.9694302Z 2025-05-07T19:45:25.9694305Z 2025-05-07T19:45:25.9694309Z 2025-05-07T19:45:25.9694313Z 2025-05-07T19:45:25.9694316Z 2025-05-07T19:45:25.9694342Z 2025-05-07T19:45:25.9694346Z 2025-05-07T19:45:25.9694349Z 2025-05-07T19:45:25.9694353Z 2025-05-07T19:45:25.9694356Z 2025-05-07T19:45:25.9694359Z 2025-05-07T19:45:26.0485226Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:26.1573943Z openjdk-23.0.1 | 181.3 MB | #####8 | 59% 2025-05-07T19:45:26.1745974Z openjdk-23.0.1 | 181.3 MB | ######1 | 62% 2025-05-07T19:45:26.1746296Z 2025-05-07T19:45:26.1746300Z 2025-05-07T19:45:26.1746305Z 2025-05-07T19:45:26.1746309Z 2025-05-07T19:45:26.1746313Z 2025-05-07T19:45:26.1746317Z 2025-05-07T19:45:26.1746320Z 2025-05-07T19:45:26.1746325Z 2025-05-07T19:45:26.1746329Z 2025-05-07T19:45:26.1746334Z 2025-05-07T19:45:26.1746337Z 2025-05-07T19:45:26.1746342Z 2025-05-07T19:45:26.1746345Z 2025-05-07T19:45:26.1746348Z 2025-05-07T19:45:26.1748782Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:26.1749078Z 2025-05-07T19:45:26.1749092Z 2025-05-07T19:45:26.1749096Z 2025-05-07T19:45:26.1749100Z 2025-05-07T19:45:26.1749103Z 2025-05-07T19:45:26.1749108Z 2025-05-07T19:45:26.1749112Z 2025-05-07T19:45:26.1749115Z 2025-05-07T19:45:26.1749120Z 2025-05-07T19:45:26.1749143Z 2025-05-07T19:45:26.1749155Z 2025-05-07T19:45:26.1749159Z 2025-05-07T19:45:26.1749163Z 2025-05-07T19:45:26.1749190Z 2025-05-07T19:45:26.2579650Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:26.2870336Z openjdk-23.0.1 | 181.3 MB | ######5 | 65% 2025-05-07T19:45:26.2870636Z 2025-05-07T19:45:26.2870641Z 2025-05-07T19:45:26.2870661Z 2025-05-07T19:45:26.2870665Z 2025-05-07T19:45:26.2870668Z 2025-05-07T19:45:26.2870672Z 2025-05-07T19:45:26.2870675Z 2025-05-07T19:45:26.2870679Z 2025-05-07T19:45:26.2870682Z 2025-05-07T19:45:26.2870686Z 2025-05-07T19:45:26.2870689Z 2025-05-07T19:45:26.2870692Z 2025-05-07T19:45:26.2870696Z 2025-05-07T19:45:26.2870699Z 2025-05-07T19:45:26.2870703Z 2025-05-07T19:45:26.2870707Z 2025-05-07T19:45:26.2872513Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:26.2872852Z 2025-05-07T19:45:26.2872873Z 2025-05-07T19:45:26.2873084Z 2025-05-07T19:45:26.2873089Z 2025-05-07T19:45:26.2873093Z 2025-05-07T19:45:26.2873096Z 2025-05-07T19:45:26.2873099Z 2025-05-07T19:45:26.2873103Z 2025-05-07T19:45:26.2873108Z 2025-05-07T19:45:26.2873111Z 2025-05-07T19:45:26.2873115Z 2025-05-07T19:45:26.2873118Z 2025-05-07T19:45:26.2873121Z 2025-05-07T19:45:26.2873125Z 2025-05-07T19:45:26.2873128Z 2025-05-07T19:45:26.2873163Z 2025-05-07T19:45:26.4288899Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:26.5423228Z openjdk-23.0.1 | 181.3 MB | ######8 | 68% 2025-05-07T19:45:26.5424752Z 2025-05-07T19:45:26.5424779Z 2025-05-07T19:45:26.5424800Z 2025-05-07T19:45:26.5424823Z 2025-05-07T19:45:26.5424841Z 2025-05-07T19:45:26.5424860Z 2025-05-07T19:45:26.5424876Z 2025-05-07T19:45:26.5424892Z 2025-05-07T19:45:26.5424908Z 2025-05-07T19:45:26.5424924Z 2025-05-07T19:45:26.5424940Z 2025-05-07T19:45:26.5424958Z 2025-05-07T19:45:26.5425415Z 2025-05-07T19:45:26.5425483Z 2025-05-07T19:45:26.5425494Z 2025-05-07T19:45:26.5426615Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:26.5427611Z 2025-05-07T19:45:26.5427622Z 2025-05-07T19:45:26.5427632Z 2025-05-07T19:45:26.5427642Z 2025-05-07T19:45:26.5427652Z 2025-05-07T19:45:26.5427661Z 2025-05-07T19:45:26.5427671Z 2025-05-07T19:45:26.5427681Z 2025-05-07T19:45:26.5427719Z 2025-05-07T19:45:26.5427729Z 2025-05-07T19:45:26.5427739Z 2025-05-07T19:45:26.5427749Z 2025-05-07T19:45:26.5427759Z 2025-05-07T19:45:26.5427769Z 2025-05-07T19:45:26.5427779Z 2025-05-07T19:45:26.6264222Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:26.7172406Z openjdk-23.0.1 | 181.3 MB | #######1 | 71% 2025-05-07T19:45:26.7173107Z 2025-05-07T19:45:26.7173117Z 2025-05-07T19:45:26.7173125Z 2025-05-07T19:45:26.7173132Z 2025-05-07T19:45:26.7173140Z 2025-05-07T19:45:26.7173169Z 2025-05-07T19:45:26.7173188Z 2025-05-07T19:45:26.7173193Z 2025-05-07T19:45:26.7173200Z 2025-05-07T19:45:26.7173207Z 2025-05-07T19:45:26.7173212Z 2025-05-07T19:45:26.7173217Z 2025-05-07T19:45:26.7173249Z 2025-05-07T19:45:26.7173254Z 2025-05-07T19:45:26.7173260Z 2025-05-07T19:45:26.7173266Z 2025-05-07T19:45:26.7173271Z 2025-05-07T19:45:26.7173659Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:26.7173979Z 2025-05-07T19:45:26.7173982Z 2025-05-07T19:45:26.7173986Z 2025-05-07T19:45:26.7173989Z 2025-05-07T19:45:26.7174014Z 2025-05-07T19:45:26.7174018Z 2025-05-07T19:45:26.7174021Z 2025-05-07T19:45:26.7174024Z 2025-05-07T19:45:26.7174028Z 2025-05-07T19:45:26.7174032Z 2025-05-07T19:45:26.7174035Z 2025-05-07T19:45:26.7174038Z 2025-05-07T19:45:26.7174042Z 2025-05-07T19:45:26.7174045Z 2025-05-07T19:45:26.7174048Z 2025-05-07T19:45:26.7174051Z 2025-05-07T19:45:26.7174055Z 2025-05-07T19:45:26.7516612Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:26.7517143Z 2025-05-07T19:45:26.7517152Z 2025-05-07T19:45:26.7517160Z 2025-05-07T19:45:26.7517168Z 2025-05-07T19:45:26.7517173Z 2025-05-07T19:45:26.7517180Z 2025-05-07T19:45:26.7517188Z 2025-05-07T19:45:26.7517196Z 2025-05-07T19:45:26.7517201Z 2025-05-07T19:45:26.7517208Z 2025-05-07T19:45:26.7517216Z 2025-05-07T19:45:26.7517221Z 2025-05-07T19:45:26.7517228Z 2025-05-07T19:45:26.7517236Z 2025-05-07T19:45:26.7517241Z 2025-05-07T19:45:26.7517249Z 2025-05-07T19:45:26.7517279Z 2025-05-07T19:45:26.7517287Z 2025-05-07T19:45:26.7517858Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:26.7518194Z 2025-05-07T19:45:26.7518197Z 2025-05-07T19:45:26.7518201Z 2025-05-07T19:45:26.7518204Z 2025-05-07T19:45:26.7518208Z 2025-05-07T19:45:26.7518211Z 2025-05-07T19:45:26.7518215Z 2025-05-07T19:45:26.7518218Z 2025-05-07T19:45:26.7518249Z 2025-05-07T19:45:26.7518459Z 2025-05-07T19:45:26.7518463Z 2025-05-07T19:45:26.7518466Z 2025-05-07T19:45:26.7518470Z 2025-05-07T19:45:26.7518473Z 2025-05-07T19:45:26.7518477Z 2025-05-07T19:45:26.7518480Z 2025-05-07T19:45:26.7518483Z 2025-05-07T19:45:26.7518487Z 2025-05-07T19:45:26.7614751Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:26.8616092Z openjdk-23.0.1 | 181.3 MB | #######3 | 74% 2025-05-07T19:45:26.9983766Z openjdk-23.0.1 | 181.3 MB | #######7 | 78% 2025-05-07T19:45:27.0990137Z openjdk-23.0.1 | 181.3 MB | ######## | 80% 2025-05-07T19:45:27.2161757Z openjdk-23.0.1 | 181.3 MB | ########3 | 83% 2025-05-07T19:45:27.3164192Z openjdk-23.0.1 | 181.3 MB | ########5 | 86% 2025-05-07T19:45:27.4605009Z openjdk-23.0.1 | 181.3 MB | ########8 | 89% 2025-05-07T19:45:27.5606239Z openjdk-23.0.1 | 181.3 MB | #########1 | 91% 2025-05-07T19:45:27.6609233Z openjdk-23.0.1 | 181.3 MB | #########3 | 94% 2025-05-07T19:45:28.0542963Z openjdk-23.0.1 | 181.3 MB | #########7 | 97% 2025-05-07T19:45:28.0543513Z 2025-05-07T19:45:28.0543520Z 2025-05-07T19:45:28.0543526Z 2025-05-07T19:45:28.3777848Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:28.3778292Z 2025-05-07T19:45:28.3778298Z 2025-05-07T19:45:28.3778302Z 2025-05-07T19:45:28.3778306Z 2025-05-07T19:45:28.3778309Z 2025-05-07T19:45:28.3778313Z 2025-05-07T19:45:28.3778316Z 2025-05-07T19:45:28.3778346Z 2025-05-07T19:45:28.3778350Z 2025-05-07T19:45:28.3778354Z 2025-05-07T19:45:28.3778358Z 2025-05-07T19:45:28.3778362Z 2025-05-07T19:45:28.3778365Z 2025-05-07T19:45:28.3778368Z 2025-05-07T19:45:28.3778372Z 2025-05-07T19:45:28.3778375Z 2025-05-07T19:45:28.3778379Z 2025-05-07T19:45:28.3778382Z 2025-05-07T19:45:28.3778386Z 2025-05-07T19:45:28.3778657Z ... (more hidden) ... 2025-05-07T19:45:28.3779001Z 2025-05-07T19:45:28.3779018Z 2025-05-07T19:45:28.3779022Z 2025-05-07T19:45:28.3779025Z 2025-05-07T19:45:28.3779028Z 2025-05-07T19:45:28.3779032Z 2025-05-07T19:45:28.3779035Z 2025-05-07T19:45:28.3779039Z 2025-05-07T19:45:28.3779042Z 2025-05-07T19:45:28.3779045Z 2025-05-07T19:45:28.3779049Z 2025-05-07T19:45:28.3779052Z 2025-05-07T19:45:28.3779056Z 2025-05-07T19:45:28.3779059Z 2025-05-07T19:45:28.3779062Z 2025-05-07T19:45:28.3779066Z 2025-05-07T19:45:28.3779069Z 2025-05-07T19:45:28.3779072Z 2025-05-07T19:45:28.3779076Z 2025-05-07T19:45:28.4006390Z ... (more hidden) ... 2025-05-07T19:45:28.4006902Z 2025-05-07T19:45:28.4006907Z 2025-05-07T19:45:29.7374500Z python-3.10.17 | 23.9 MB | ########## | 100%  2025-05-07T19:45:29.7473658Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:29.7475165Z 2025-05-07T19:45:30.4976480Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:30.4981161Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:30.4981711Z 2025-05-07T19:45:30.4981720Z 2025-05-07T19:45:30.4981726Z 2025-05-07T19:45:30.4981734Z 2025-05-07T19:45:30.4981741Z 2025-05-07T19:45:30.4981749Z 2025-05-07T19:45:30.4981755Z 2025-05-07T19:45:30.4981762Z 2025-05-07T19:45:30.4981770Z 2025-05-07T19:45:30.4981776Z 2025-05-07T19:45:30.4981783Z 2025-05-07T19:45:30.4981791Z 2025-05-07T19:45:30.4981797Z 2025-05-07T19:45:30.4981804Z 2025-05-07T19:45:30.4981810Z 2025-05-07T19:45:30.4981815Z 2025-05-07T19:45:30.4981820Z 2025-05-07T19:45:30.4981825Z 2025-05-07T19:45:30.4981830Z 2025-05-07T19:45:30.4982028Z 2025-05-07T19:45:30.4982649Z  2025-05-07T19:45:30.4983310Z 2025-05-07T19:45:30.4983714Z 2025-05-07T19:45:30.4984010Z  2025-05-07T19:45:30.4984418Z 2025-05-07T19:45:30.4984433Z 2025-05-07T19:45:30.4985157Z  2025-05-07T19:45:30.4985621Z 2025-05-07T19:45:30.4985629Z 2025-05-07T19:45:30.4985639Z 2025-05-07T19:45:30.4985936Z  2025-05-07T19:45:30.4986317Z 2025-05-07T19:45:30.4986324Z 2025-05-07T19:45:30.4986333Z 2025-05-07T19:45:30.4986339Z 2025-05-07T19:45:30.4986658Z  2025-05-07T19:45:30.4987021Z 2025-05-07T19:45:30.4987030Z 2025-05-07T19:45:30.4987035Z 2025-05-07T19:45:30.4987043Z 2025-05-07T19:45:30.4987048Z 2025-05-07T19:45:30.4987321Z  2025-05-07T19:45:30.4987806Z 2025-05-07T19:45:30.4987814Z 2025-05-07T19:45:30.4987820Z 2025-05-07T19:45:30.4987828Z 2025-05-07T19:45:30.4987835Z 2025-05-07T19:45:30.4987844Z 2025-05-07T19:45:30.4988190Z  2025-05-07T19:45:30.4988864Z 2025-05-07T19:45:30.4988873Z 2025-05-07T19:45:30.4988878Z 2025-05-07T19:45:30.4988886Z 2025-05-07T19:45:30.4988894Z 2025-05-07T19:45:30.4988902Z 2025-05-07T19:45:30.4988907Z 2025-05-07T19:45:30.4989215Z  2025-05-07T19:45:30.4989625Z 2025-05-07T19:45:30.4989661Z 2025-05-07T19:45:30.4989666Z 2025-05-07T19:45:30.4989673Z 2025-05-07T19:45:30.4989680Z 2025-05-07T19:45:30.4989685Z 2025-05-07T19:45:30.4989691Z 2025-05-07T19:45:30.4989696Z 2025-05-07T19:45:30.4990038Z  2025-05-07T19:45:30.4990443Z 2025-05-07T19:45:30.4990448Z 2025-05-07T19:45:30.4990454Z 2025-05-07T19:45:30.4990492Z 2025-05-07T19:45:30.4990497Z 2025-05-07T19:45:30.4990501Z 2025-05-07T19:45:30.4990506Z 2025-05-07T19:45:30.4990511Z 2025-05-07T19:45:30.4990515Z 2025-05-07T19:45:30.4990828Z  2025-05-07T19:45:30.4991184Z 2025-05-07T19:45:30.4991188Z 2025-05-07T19:45:30.4991193Z 2025-05-07T19:45:30.4991200Z 2025-05-07T19:45:30.4991239Z 2025-05-07T19:45:30.4991245Z 2025-05-07T19:45:30.4991250Z 2025-05-07T19:45:30.4991254Z 2025-05-07T19:45:30.4991259Z 2025-05-07T19:45:30.4991265Z 2025-05-07T19:45:30.4991687Z  2025-05-07T19:45:30.4992052Z 2025-05-07T19:45:30.4992057Z 2025-05-07T19:45:30.4992061Z 2025-05-07T19:45:30.4992066Z 2025-05-07T19:45:30.4992070Z 2025-05-07T19:45:30.4992076Z 2025-05-07T19:45:30.4992081Z 2025-05-07T19:45:30.4992085Z 2025-05-07T19:45:30.4992090Z 2025-05-07T19:45:30.4992095Z 2025-05-07T19:45:30.4992099Z 2025-05-07T19:45:30.4992449Z  2025-05-07T19:45:30.4992801Z 2025-05-07T19:45:30.4992807Z 2025-05-07T19:45:30.4992811Z 2025-05-07T19:45:30.4992815Z 2025-05-07T19:45:30.4992825Z 2025-05-07T19:45:30.4992834Z 2025-05-07T19:45:30.4992839Z 2025-05-07T19:45:30.4992844Z 2025-05-07T19:45:30.4992850Z 2025-05-07T19:45:30.4992855Z 2025-05-07T19:45:30.4992859Z 2025-05-07T19:45:30.4992865Z 2025-05-07T19:45:30.4993295Z  2025-05-07T19:45:30.4993756Z 2025-05-07T19:45:30.4993760Z 2025-05-07T19:45:30.4993765Z 2025-05-07T19:45:30.4993772Z 2025-05-07T19:45:30.4993777Z 2025-05-07T19:45:30.4993784Z 2025-05-07T19:45:30.4993789Z 2025-05-07T19:45:30.4993794Z 2025-05-07T19:45:30.4993808Z 2025-05-07T19:45:30.4993813Z 2025-05-07T19:45:30.4993818Z 2025-05-07T19:45:30.4993822Z 2025-05-07T19:45:30.4993862Z 2025-05-07T19:45:30.4994224Z  2025-05-07T19:45:30.4994628Z 2025-05-07T19:45:30.4994633Z 2025-05-07T19:45:30.4994638Z 2025-05-07T19:45:30.4994642Z 2025-05-07T19:45:30.4994647Z 2025-05-07T19:45:30.4994660Z 2025-05-07T19:45:30.4994775Z 2025-05-07T19:45:30.4994784Z 2025-05-07T19:45:30.4994825Z 2025-05-07T19:45:30.4994829Z 2025-05-07T19:45:30.4994834Z 2025-05-07T19:45:30.4994839Z 2025-05-07T19:45:30.4994843Z 2025-05-07T19:45:30.4994848Z 2025-05-07T19:45:30.4995210Z  2025-05-07T19:45:30.4995665Z 2025-05-07T19:45:30.4995669Z 2025-05-07T19:45:30.4995674Z 2025-05-07T19:45:30.4995680Z 2025-05-07T19:45:30.4995724Z 2025-05-07T19:45:30.4995729Z 2025-05-07T19:45:30.4995733Z 2025-05-07T19:45:30.4995738Z 2025-05-07T19:45:30.4995743Z 2025-05-07T19:45:30.4995749Z 2025-05-07T19:45:30.4995753Z 2025-05-07T19:45:30.4995758Z 2025-05-07T19:45:30.4995763Z 2025-05-07T19:45:30.4995767Z 2025-05-07T19:45:30.4995773Z 2025-05-07T19:45:30.4996113Z  2025-05-07T19:45:30.4996545Z 2025-05-07T19:45:30.4996552Z 2025-05-07T19:45:30.4996639Z 2025-05-07T19:45:30.4996648Z 2025-05-07T19:45:30.4996653Z 2025-05-07T19:45:30.4996660Z 2025-05-07T19:45:30.4996665Z 2025-05-07T19:45:30.4996670Z 2025-05-07T19:45:30.4996676Z 2025-05-07T19:45:30.4996680Z 2025-05-07T19:45:30.4996687Z 2025-05-07T19:45:30.4996691Z 2025-05-07T19:45:30.4996696Z 2025-05-07T19:45:30.4996700Z 2025-05-07T19:45:30.4996705Z 2025-05-07T19:45:30.4996711Z 2025-05-07T19:45:30.4997071Z  2025-05-07T19:45:30.4997498Z 2025-05-07T19:45:30.4997503Z 2025-05-07T19:45:30.4997511Z 2025-05-07T19:45:30.4997516Z 2025-05-07T19:45:30.4997521Z 2025-05-07T19:45:30.4997526Z 2025-05-07T19:45:30.4997530Z 2025-05-07T19:45:30.4997534Z 2025-05-07T19:45:30.4997538Z 2025-05-07T19:45:30.4997543Z 2025-05-07T19:45:30.4997547Z 2025-05-07T19:45:30.4997552Z 2025-05-07T19:45:30.4997556Z 2025-05-07T19:45:30.4997561Z 2025-05-07T19:45:30.4997566Z 2025-05-07T19:45:30.4997571Z 2025-05-07T19:45:30.4997619Z 2025-05-07T19:45:30.4997981Z  2025-05-07T19:45:30.4998443Z 2025-05-07T19:45:30.4998447Z 2025-05-07T19:45:30.4998452Z 2025-05-07T19:45:30.4998456Z 2025-05-07T19:45:30.4998465Z 2025-05-07T19:45:30.4998469Z 2025-05-07T19:45:30.4998474Z 2025-05-07T19:45:30.4998479Z 2025-05-07T19:45:30.4998484Z 2025-05-07T19:45:30.4998515Z 2025-05-07T19:45:30.4998523Z 2025-05-07T19:45:30.4998529Z 2025-05-07T19:45:30.4998536Z 2025-05-07T19:45:30.4998544Z 2025-05-07T19:45:30.4998550Z 2025-05-07T19:45:30.4998557Z 2025-05-07T19:45:30.4998565Z 2025-05-07T19:45:30.4998574Z 2025-05-07T19:45:30.4998958Z  2025-05-07T19:45:30.4999427Z 2025-05-07T19:45:30.4999433Z 2025-05-07T19:45:30.4999604Z  2025-05-07T19:45:30.4999796Z 2025-05-07T19:45:30.4999801Z 2025-05-07T19:45:30.4999986Z  2025-05-07T19:45:30.5000190Z 2025-05-07T19:45:30.5000200Z 2025-05-07T19:45:30.5000205Z 2025-05-07T19:45:30.5000372Z  2025-05-07T19:45:30.5000536Z 2025-05-07T19:45:30.5000540Z 2025-05-07T19:45:30.5000545Z 2025-05-07T19:45:30.5000579Z 2025-05-07T19:45:30.5000739Z  2025-05-07T19:45:30.5000953Z 2025-05-07T19:45:30.5000959Z 2025-05-07T19:45:30.5000966Z 2025-05-07T19:45:30.5000974Z 2025-05-07T19:45:30.5000979Z 2025-05-07T19:45:30.5001202Z  2025-05-07T19:45:30.5001398Z 2025-05-07T19:45:30.5001402Z 2025-05-07T19:45:30.5001410Z 2025-05-07T19:45:30.5001414Z 2025-05-07T19:45:30.5001418Z 2025-05-07T19:45:30.5001423Z 2025-05-07T19:45:30.5001609Z  2025-05-07T19:45:30.5001834Z 2025-05-07T19:45:30.5001838Z 2025-05-07T19:45:30.5001843Z 2025-05-07T19:45:30.5001848Z 2025-05-07T19:45:30.5001853Z 2025-05-07T19:45:30.5001859Z 2025-05-07T19:45:30.5001863Z 2025-05-07T19:45:30.5002087Z  2025-05-07T19:45:30.5002347Z 2025-05-07T19:45:30.5002393Z 2025-05-07T19:45:30.5002487Z 2025-05-07T19:45:30.5002492Z 2025-05-07T19:45:30.5002498Z 2025-05-07T19:45:30.5002503Z 2025-05-07T19:45:30.5002507Z 2025-05-07T19:45:30.5002511Z 2025-05-07T19:45:30.5002710Z  2025-05-07T19:45:30.5002968Z 2025-05-07T19:45:30.5002972Z 2025-05-07T19:45:30.5002977Z 2025-05-07T19:45:30.5002982Z 2025-05-07T19:45:30.5003018Z 2025-05-07T19:45:30.5003024Z 2025-05-07T19:45:30.5003031Z 2025-05-07T19:45:30.5003039Z 2025-05-07T19:45:30.5003045Z 2025-05-07T19:45:30.5003247Z  2025-05-07T19:45:30.5003532Z 2025-05-07T19:45:30.5003539Z 2025-05-07T19:45:30.5003547Z 2025-05-07T19:45:30.5003551Z 2025-05-07T19:45:30.5003556Z 2025-05-07T19:45:30.5003564Z 2025-05-07T19:45:30.5003601Z 2025-05-07T19:45:30.5003606Z 2025-05-07T19:45:30.5003613Z 2025-05-07T19:45:30.5003621Z 2025-05-07T19:45:30.5003843Z  2025-05-07T19:45:30.5004107Z 2025-05-07T19:45:30.5004112Z 2025-05-07T19:45:30.5004116Z 2025-05-07T19:45:30.5004237Z 2025-05-07T19:45:30.5004246Z 2025-05-07T19:45:30.5004254Z 2025-05-07T19:45:30.5004258Z 2025-05-07T19:45:30.5004294Z 2025-05-07T19:45:30.5004299Z 2025-05-07T19:45:30.5004303Z 2025-05-07T19:45:30.5004308Z 2025-05-07T19:45:30.5004502Z  2025-05-07T19:45:30.5004754Z 2025-05-07T19:45:30.5004759Z 2025-05-07T19:45:30.5004763Z 2025-05-07T19:45:30.5004769Z 2025-05-07T19:45:30.5004773Z 2025-05-07T19:45:30.5004777Z 2025-05-07T19:45:30.5004781Z 2025-05-07T19:45:30.5004821Z 2025-05-07T19:45:30.5004826Z 2025-05-07T19:45:30.5004831Z 2025-05-07T19:45:30.5004838Z 2025-05-07T19:45:30.5004843Z 2025-05-07T19:45:30.5005045Z  2025-05-07T19:45:30.5005359Z 2025-05-07T19:45:30.5005365Z 2025-05-07T19:45:30.5005370Z 2025-05-07T19:45:30.5005651Z 2025-05-07T19:45:30.5005696Z 2025-05-07T19:45:30.5005701Z 2025-05-07T19:45:30.5005708Z 2025-05-07T19:45:30.5005716Z 2025-05-07T19:45:30.5005722Z 2025-05-07T19:45:30.5005729Z 2025-05-07T19:45:30.5005745Z 2025-05-07T19:45:30.5005757Z 2025-05-07T19:45:30.5005762Z 2025-05-07T19:45:30.5006049Z  2025-05-07T19:45:30.5006361Z 2025-05-07T19:45:30.5006365Z 2025-05-07T19:45:30.5006394Z 2025-05-07T19:45:30.5006397Z 2025-05-07T19:45:30.5006401Z 2025-05-07T19:45:30.5006404Z 2025-05-07T19:45:30.5006407Z 2025-05-07T19:45:30.5006411Z 2025-05-07T19:45:30.5006415Z 2025-05-07T19:45:30.5006418Z 2025-05-07T19:45:30.5006421Z 2025-05-07T19:45:30.5006425Z 2025-05-07T19:45:30.5006428Z 2025-05-07T19:45:30.5006431Z 2025-05-07T19:45:30.5006592Z  2025-05-07T19:45:30.5006832Z 2025-05-07T19:45:30.5006836Z 2025-05-07T19:45:30.5006840Z 2025-05-07T19:45:30.5006843Z 2025-05-07T19:45:30.5006847Z 2025-05-07T19:45:30.5006850Z 2025-05-07T19:45:30.5006853Z 2025-05-07T19:45:30.5006856Z 2025-05-07T19:45:30.5006860Z 2025-05-07T19:45:30.5006863Z 2025-05-07T19:45:30.5006866Z 2025-05-07T19:45:30.5006870Z 2025-05-07T19:45:30.5006877Z 2025-05-07T19:45:30.5006884Z 2025-05-07T19:45:30.5006887Z 2025-05-07T19:45:30.5007048Z  2025-05-07T19:45:30.5007295Z 2025-05-07T19:45:30.5007298Z 2025-05-07T19:45:30.5007302Z 2025-05-07T19:45:30.5007306Z 2025-05-07T19:45:30.5007309Z 2025-05-07T19:45:30.5007313Z 2025-05-07T19:45:30.5007316Z 2025-05-07T19:45:30.5007320Z 2025-05-07T19:45:30.5007323Z 2025-05-07T19:45:30.5007327Z 2025-05-07T19:45:30.5007330Z 2025-05-07T19:45:30.5007333Z 2025-05-07T19:45:30.5007336Z 2025-05-07T19:45:30.5007340Z 2025-05-07T19:45:30.5007344Z 2025-05-07T19:45:30.5007347Z 2025-05-07T19:45:30.5007546Z  2025-05-07T19:45:30.5007766Z 2025-05-07T19:45:30.5007769Z 2025-05-07T19:45:30.5007773Z 2025-05-07T19:45:30.5007776Z 2025-05-07T19:45:30.5007779Z 2025-05-07T19:45:30.5007783Z 2025-05-07T19:45:30.5007786Z 2025-05-07T19:45:30.5007789Z 2025-05-07T19:45:30.5007792Z 2025-05-07T19:45:30.5007796Z 2025-05-07T19:45:30.5007802Z 2025-05-07T19:45:30.5007948Z 2025-05-07T19:45:30.5007953Z 2025-05-07T19:45:30.5007982Z 2025-05-07T19:45:30.5007985Z 2025-05-07T19:45:30.5007989Z 2025-05-07T19:45:30.5007992Z 2025-05-07T19:45:30.5008160Z  2025-05-07T19:45:30.5008387Z 2025-05-07T19:45:30.5008391Z 2025-05-07T19:45:30.5008394Z 2025-05-07T19:45:30.5008398Z 2025-05-07T19:45:30.5008401Z 2025-05-07T19:45:30.5008405Z 2025-05-07T19:45:30.5008433Z 2025-05-07T19:45:30.5008436Z 2025-05-07T19:45:30.5008440Z 2025-05-07T19:45:30.5008443Z 2025-05-07T19:45:30.5008446Z 2025-05-07T19:45:30.5008450Z 2025-05-07T19:45:30.5008453Z 2025-05-07T19:45:30.5008456Z 2025-05-07T19:45:30.5008460Z 2025-05-07T19:45:30.5008463Z 2025-05-07T19:45:30.5008466Z 2025-05-07T19:45:30.5008470Z 2025-05-07T19:45:30.5008645Z  2025-05-07T19:45:30.5008902Z 2025-05-07T19:45:30.5008905Z 2025-05-07T19:45:30.5009015Z  2025-05-07T19:45:30.5009135Z 2025-05-07T19:45:30.5009196Z 2025-05-07T19:45:30.5009341Z  2025-05-07T19:45:30.5009535Z 2025-05-07T19:45:30.5009542Z 2025-05-07T19:45:30.5009546Z 2025-05-07T19:45:30.5009738Z  2025-05-07T19:45:30.5009977Z 2025-05-07T19:45:30.5009983Z 2025-05-07T19:45:30.5010028Z 2025-05-07T19:45:30.5010036Z 2025-05-07T19:45:30.5010260Z  2025-05-07T19:45:30.5010473Z 2025-05-07T19:45:30.5010477Z 2025-05-07T19:45:30.5010481Z 2025-05-07T19:45:30.5010484Z 2025-05-07T19:45:30.5010488Z 2025-05-07T19:45:30.5010636Z  2025-05-07T19:45:30.5010774Z 2025-05-07T19:45:30.5010777Z 2025-05-07T19:45:30.5010781Z 2025-05-07T19:45:30.5010784Z 2025-05-07T19:45:30.5010788Z 2025-05-07T19:45:30.5010791Z 2025-05-07T19:45:30.5010910Z  2025-05-07T19:45:30.5011080Z 2025-05-07T19:45:30.5011084Z 2025-05-07T19:45:30.5011087Z 2025-05-07T19:45:30.5011091Z 2025-05-07T19:45:30.5011095Z 2025-05-07T19:45:30.5011098Z 2025-05-07T19:45:30.5011101Z 2025-05-07T19:45:30.5011225Z  2025-05-07T19:45:30.5011414Z 2025-05-07T19:45:30.5011418Z 2025-05-07T19:45:30.5011421Z 2025-05-07T19:45:30.5011425Z 2025-05-07T19:45:30.5011428Z 2025-05-07T19:45:30.5011431Z 2025-05-07T19:45:30.5011435Z 2025-05-07T19:45:30.5011438Z 2025-05-07T19:45:30.5011565Z  2025-05-07T19:45:30.5011725Z 2025-05-07T19:45:30.5011728Z 2025-05-07T19:45:30.5011732Z 2025-05-07T19:45:30.5011761Z 2025-05-07T19:45:30.5011765Z 2025-05-07T19:45:30.5011768Z 2025-05-07T19:45:30.5011772Z 2025-05-07T19:45:30.5011775Z 2025-05-07T19:45:30.5011778Z 2025-05-07T19:45:30.5011910Z  2025-05-07T19:45:30.5012083Z 2025-05-07T19:45:30.5012086Z 2025-05-07T19:45:30.5012090Z 2025-05-07T19:45:30.5012093Z 2025-05-07T19:45:30.5012097Z 2025-05-07T19:45:30.5012124Z 2025-05-07T19:45:30.5012128Z 2025-05-07T19:45:30.5012131Z 2025-05-07T19:45:30.5012134Z 2025-05-07T19:45:30.5012138Z 2025-05-07T19:45:30.5012272Z  2025-05-07T19:45:30.5012447Z 2025-05-07T19:45:30.5012457Z 2025-05-07T19:45:30.5012461Z 2025-05-07T19:45:30.5012464Z 2025-05-07T19:45:30.5012467Z 2025-05-07T19:45:30.5012471Z 2025-05-07T19:45:30.5012498Z 2025-05-07T19:45:30.5012501Z 2025-05-07T19:45:30.5012504Z 2025-05-07T19:45:30.5012508Z 2025-05-07T19:45:30.5012511Z 2025-05-07T19:45:30.5012646Z  2025-05-07T19:45:30.5012831Z 2025-05-07T19:45:30.5012834Z 2025-05-07T19:45:30.5012838Z 2025-05-07T19:45:30.5012841Z 2025-05-07T19:45:30.5012845Z 2025-05-07T19:45:30.5012849Z 2025-05-07T19:45:30.5012875Z 2025-05-07T19:45:30.5012878Z 2025-05-07T19:45:30.5012881Z 2025-05-07T19:45:30.5012885Z 2025-05-07T19:45:30.5012888Z 2025-05-07T19:45:30.5012891Z 2025-05-07T19:45:30.5013032Z  2025-05-07T19:45:30.5013225Z 2025-05-07T19:45:30.5013229Z 2025-05-07T19:45:30.5013232Z 2025-05-07T19:45:30.5013236Z 2025-05-07T19:45:30.5013263Z 2025-05-07T19:45:30.5013266Z 2025-05-07T19:45:30.5013270Z 2025-05-07T19:45:30.5013277Z 2025-05-07T19:45:30.5013342Z 2025-05-07T19:45:30.5013347Z 2025-05-07T19:45:30.5013350Z 2025-05-07T19:45:30.5013353Z 2025-05-07T19:45:30.5013357Z 2025-05-07T19:45:30.5013502Z  2025-05-07T19:45:30.5013706Z 2025-05-07T19:45:30.5013710Z 2025-05-07T19:45:30.5013736Z 2025-05-07T19:45:30.5013740Z 2025-05-07T19:45:30.5013743Z 2025-05-07T19:45:30.5013746Z 2025-05-07T19:45:30.5013749Z 2025-05-07T19:45:30.5013753Z 2025-05-07T19:45:30.5013756Z 2025-05-07T19:45:30.5013760Z 2025-05-07T19:45:30.5013763Z 2025-05-07T19:45:30.5013766Z 2025-05-07T19:45:30.5013770Z 2025-05-07T19:45:30.5013773Z 2025-05-07T19:45:30.5013923Z  2025-05-07T19:45:30.5014162Z 2025-05-07T19:45:30.5014166Z 2025-05-07T19:45:30.5014169Z 2025-05-07T19:45:30.5014173Z 2025-05-07T19:45:30.5014176Z 2025-05-07T19:45:30.5014179Z 2025-05-07T19:45:30.5014183Z 2025-05-07T19:45:30.5014186Z 2025-05-07T19:45:30.5014189Z 2025-05-07T19:45:30.5014193Z 2025-05-07T19:45:30.5014251Z 2025-05-07T19:45:30.5014257Z 2025-05-07T19:45:30.5014261Z 2025-05-07T19:45:30.5014264Z 2025-05-07T19:45:30.5014267Z 2025-05-07T19:45:30.5014426Z  2025-05-07T19:45:30.5014664Z 2025-05-07T19:45:30.5014667Z 2025-05-07T19:45:30.5014671Z 2025-05-07T19:45:30.5014674Z 2025-05-07T19:45:30.5014678Z 2025-05-07T19:45:30.5014681Z 2025-05-07T19:45:30.5014684Z 2025-05-07T19:45:30.5014688Z 2025-05-07T19:45:30.5014691Z 2025-05-07T19:45:30.5014694Z 2025-05-07T19:45:30.5014698Z 2025-05-07T19:45:30.5014701Z 2025-05-07T19:45:30.5014704Z 2025-05-07T19:45:30.5014708Z 2025-05-07T19:45:30.5014711Z 2025-05-07T19:45:30.5014714Z 2025-05-07T19:45:30.5014902Z  2025-05-07T19:45:30.5015122Z 2025-05-07T19:45:30.5015125Z 2025-05-07T19:45:30.5015129Z 2025-05-07T19:45:30.5015133Z 2025-05-07T19:45:30.5015136Z 2025-05-07T19:45:30.5015139Z 2025-05-07T19:45:30.5015143Z 2025-05-07T19:45:30.5015146Z 2025-05-07T19:45:30.5015153Z 2025-05-07T19:45:30.5015159Z 2025-05-07T19:45:30.5015163Z 2025-05-07T19:45:30.5015167Z 2025-05-07T19:45:30.5015170Z 2025-05-07T19:45:30.5015199Z 2025-05-07T19:45:30.5015202Z 2025-05-07T19:45:30.5015205Z 2025-05-07T19:45:30.5015209Z 2025-05-07T19:45:30.5015377Z  2025-05-07T19:45:30.5015716Z 2025-05-07T19:45:30.5015719Z 2025-05-07T19:45:30.5015722Z 2025-05-07T19:45:30.5015726Z 2025-05-07T19:45:30.5015729Z 2025-05-07T19:45:30.5015732Z 2025-05-07T19:45:30.5015760Z 2025-05-07T19:45:30.5015763Z 2025-05-07T19:45:30.5015766Z 2025-05-07T19:45:30.5015769Z 2025-05-07T19:45:30.5015773Z 2025-05-07T19:45:30.5015776Z 2025-05-07T19:45:30.5015779Z 2025-05-07T19:45:30.5015782Z 2025-05-07T19:45:30.5015785Z 2025-05-07T19:45:30.5015789Z 2025-05-07T19:45:30.5015792Z 2025-05-07T19:45:30.5015795Z 2025-05-07T19:45:30.5015966Z  2025-05-07T19:45:30.5016216Z 2025-05-07T19:45:30.5016223Z 2025-05-07T19:45:30.5016330Z  2025-05-07T19:45:30.5016444Z 2025-05-07T19:45:30.5016447Z 2025-05-07T19:45:30.5016580Z  2025-05-07T19:45:30.5016695Z 2025-05-07T19:45:30.5016699Z 2025-05-07T19:45:30.5016703Z 2025-05-07T19:45:30.5016810Z  2025-05-07T19:45:30.5016930Z 2025-05-07T19:45:30.5016934Z 2025-05-07T19:45:30.5016964Z 2025-05-07T19:45:30.5016967Z 2025-05-07T19:45:30.5017077Z  2025-05-07T19:45:30.5017201Z 2025-05-07T19:45:30.5017204Z 2025-05-07T19:45:30.5017208Z 2025-05-07T19:45:30.5017211Z 2025-05-07T19:45:30.5017215Z 2025-05-07T19:45:30.5017360Z  2025-05-07T19:45:30.5017490Z 2025-05-07T19:45:30.5017494Z 2025-05-07T19:45:30.5017497Z 2025-05-07T19:45:30.5017501Z 2025-05-07T19:45:30.5017504Z 2025-05-07T19:45:30.5017507Z 2025-05-07T19:45:30.5017625Z  2025-05-07T19:45:30.5017785Z 2025-05-07T19:45:30.5017788Z 2025-05-07T19:45:30.5017792Z 2025-05-07T19:45:30.5017795Z 2025-05-07T19:45:30.5017799Z 2025-05-07T19:45:30.5017806Z 2025-05-07T19:45:30.5017864Z 2025-05-07T19:45:30.5017985Z  2025-05-07T19:45:30.5018162Z 2025-05-07T19:45:30.5018165Z 2025-05-07T19:45:30.5018168Z 2025-05-07T19:45:30.5018172Z 2025-05-07T19:45:30.5018175Z 2025-05-07T19:45:30.5018178Z 2025-05-07T19:45:30.5018181Z 2025-05-07T19:45:30.5018185Z 2025-05-07T19:45:30.5018308Z  2025-05-07T19:45:30.5018462Z 2025-05-07T19:45:30.5018466Z 2025-05-07T19:45:30.5018469Z 2025-05-07T19:45:30.5018496Z 2025-05-07T19:45:30.5018499Z 2025-05-07T19:45:30.5018502Z 2025-05-07T19:45:30.5018505Z 2025-05-07T19:45:30.5018509Z 2025-05-07T19:45:30.5018512Z 2025-05-07T19:45:30.5018637Z  2025-05-07T19:45:30.5018801Z 2025-05-07T19:45:30.5018805Z 2025-05-07T19:45:30.5018808Z 2025-05-07T19:45:30.5018812Z 2025-05-07T19:45:30.5018816Z 2025-05-07T19:45:30.5018842Z 2025-05-07T19:45:30.5018846Z 2025-05-07T19:45:30.5018849Z 2025-05-07T19:45:30.5018852Z 2025-05-07T19:45:30.5018856Z 2025-05-07T19:45:30.5019053Z  2025-05-07T19:45:30.5019225Z 2025-05-07T19:45:30.5019228Z 2025-05-07T19:45:30.5019232Z 2025-05-07T19:45:30.5019235Z 2025-05-07T19:45:30.5019238Z 2025-05-07T19:45:30.5019241Z 2025-05-07T19:45:30.5019268Z 2025-05-07T19:45:30.5019271Z 2025-05-07T19:45:30.5019274Z 2025-05-07T19:45:30.5019278Z 2025-05-07T19:45:30.5019281Z 2025-05-07T19:45:30.5019527Z  2025-05-07T19:45:30.5019710Z 2025-05-07T19:45:30.5019714Z 2025-05-07T19:45:30.5019717Z 2025-05-07T19:45:30.5019720Z 2025-05-07T19:45:30.5019723Z 2025-05-07T19:45:30.5019727Z 2025-05-07T19:45:30.5019933Z 2025-05-07T19:45:30.5019936Z 2025-05-07T19:45:30.5019939Z 2025-05-07T19:45:30.5019943Z 2025-05-07T19:45:30.5019946Z 2025-05-07T19:45:30.5019949Z 2025-05-07T19:45:30.5020097Z  2025-05-07T19:45:30.5020293Z 2025-05-07T19:45:30.5020297Z 2025-05-07T19:45:30.5020300Z 2025-05-07T19:45:30.5020304Z 2025-05-07T19:45:30.5020335Z 2025-05-07T19:45:30.5020343Z 2025-05-07T19:45:30.5020350Z 2025-05-07T19:45:30.5020353Z 2025-05-07T19:45:30.5020357Z 2025-05-07T19:45:30.5020360Z 2025-05-07T19:45:30.5020364Z 2025-05-07T19:45:30.5020367Z 2025-05-07T19:45:30.5020370Z 2025-05-07T19:45:30.5020517Z  2025-05-07T19:45:30.5020716Z 2025-05-07T19:45:30.5020720Z 2025-05-07T19:45:30.5020747Z 2025-05-07T19:45:30.5020751Z 2025-05-07T19:45:30.5020754Z 2025-05-07T19:45:30.5020757Z 2025-05-07T19:45:30.5020761Z 2025-05-07T19:45:30.5020764Z 2025-05-07T19:45:30.5020767Z 2025-05-07T19:45:30.5020771Z 2025-05-07T19:45:30.5020775Z 2025-05-07T19:45:30.5020778Z 2025-05-07T19:45:30.5020781Z 2025-05-07T19:45:30.5020785Z 2025-05-07T19:45:30.5020934Z  2025-05-07T19:45:30.5021173Z 2025-05-07T19:45:30.5021176Z 2025-05-07T19:45:30.5021180Z 2025-05-07T19:45:30.5021183Z 2025-05-07T19:45:30.5021187Z 2025-05-07T19:45:30.5021191Z 2025-05-07T19:45:30.5021194Z 2025-05-07T19:45:30.5021201Z 2025-05-07T19:45:30.5021207Z 2025-05-07T19:45:30.5021210Z 2025-05-07T19:45:30.5021214Z 2025-05-07T19:45:30.5021217Z 2025-05-07T19:45:30.5021220Z 2025-05-07T19:45:30.5021224Z 2025-05-07T19:45:30.5021227Z 2025-05-07T19:45:30.5021383Z  2025-05-07T19:45:30.5021623Z 2025-05-07T19:45:30.5021627Z 2025-05-07T19:45:30.5021630Z 2025-05-07T19:45:30.5021633Z 2025-05-07T19:45:30.5021637Z 2025-05-07T19:45:30.5021640Z 2025-05-07T19:45:30.5021643Z 2025-05-07T19:45:30.5021647Z 2025-05-07T19:45:30.5021650Z 2025-05-07T19:45:30.5021653Z 2025-05-07T19:45:30.5021656Z 2025-05-07T19:45:30.5021660Z 2025-05-07T19:45:30.5021663Z 2025-05-07T19:45:30.5021666Z 2025-05-07T19:45:30.5021670Z 2025-05-07T19:45:30.5021673Z 2025-05-07T19:45:30.5021857Z  2025-05-07T19:45:30.5022318Z 2025-05-07T19:45:30.5022323Z 2025-05-07T19:45:30.5022326Z 2025-05-07T19:45:30.5022330Z 2025-05-07T19:45:30.5022333Z 2025-05-07T19:45:30.5022337Z 2025-05-07T19:45:30.5022486Z 2025-05-07T19:45:30.5022491Z 2025-05-07T19:45:30.5022494Z 2025-05-07T19:45:30.5022498Z 2025-05-07T19:45:30.5022501Z 2025-05-07T19:45:30.5022504Z 2025-05-07T19:45:30.5022508Z 2025-05-07T19:45:30.5022536Z 2025-05-07T19:45:30.5022540Z 2025-05-07T19:45:30.5022543Z 2025-05-07T19:45:30.5022546Z 2025-05-07T19:45:30.5022726Z  2025-05-07T19:45:30.5022954Z 2025-05-07T19:45:30.5022958Z 2025-05-07T19:45:30.5022961Z 2025-05-07T19:45:30.5022965Z 2025-05-07T19:45:30.5022968Z 2025-05-07T19:45:30.5022972Z 2025-05-07T19:45:30.5023001Z 2025-05-07T19:45:30.5023004Z 2025-05-07T19:45:30.5023008Z 2025-05-07T19:45:30.5023011Z 2025-05-07T19:45:30.5023015Z 2025-05-07T19:45:30.5023018Z 2025-05-07T19:45:30.5023021Z 2025-05-07T19:45:30.5023025Z 2025-05-07T19:45:30.5023028Z 2025-05-07T19:45:30.5023031Z 2025-05-07T19:45:30.5023035Z 2025-05-07T19:45:30.5023038Z 2025-05-07T19:45:30.5023212Z  2025-05-07T19:45:30.5023553Z 2025-05-07T19:45:30.5023557Z 2025-05-07T19:45:30.5023663Z  2025-05-07T19:45:30.5023783Z 2025-05-07T19:45:30.5023787Z 2025-05-07T19:45:30.5023930Z  2025-05-07T19:45:30.5024056Z 2025-05-07T19:45:30.5024059Z 2025-05-07T19:45:30.5024062Z 2025-05-07T19:45:30.5024175Z  2025-05-07T19:45:30.5024302Z 2025-05-07T19:45:30.5024305Z 2025-05-07T19:45:30.5024337Z 2025-05-07T19:45:30.5024340Z 2025-05-07T19:45:30.5024457Z  2025-05-07T19:45:30.5024588Z 2025-05-07T19:45:30.5024592Z 2025-05-07T19:45:30.5024596Z 2025-05-07T19:45:30.5024599Z 2025-05-07T19:45:30.5024603Z 2025-05-07T19:45:30.5024751Z  2025-05-07T19:45:30.5024888Z 2025-05-07T19:45:30.5024892Z 2025-05-07T19:45:30.5024895Z 2025-05-07T19:45:30.5024899Z 2025-05-07T19:45:30.5024902Z 2025-05-07T19:45:30.5024905Z 2025-05-07T19:45:30.5025024Z  2025-05-07T19:45:30.5025190Z 2025-05-07T19:45:30.5025194Z 2025-05-07T19:45:30.5025197Z 2025-05-07T19:45:30.5025205Z 2025-05-07T19:45:30.5025211Z 2025-05-07T19:45:30.5025214Z 2025-05-07T19:45:30.5025218Z 2025-05-07T19:45:30.5025341Z  2025-05-07T19:45:30.5025516Z 2025-05-07T19:45:30.5025520Z 2025-05-07T19:45:30.5025523Z 2025-05-07T19:45:30.5025526Z 2025-05-07T19:45:30.5025530Z 2025-05-07T19:45:30.5025533Z 2025-05-07T19:45:30.5025536Z 2025-05-07T19:45:30.5025540Z 2025-05-07T19:45:30.5025667Z  2025-05-07T19:45:30.5025831Z 2025-05-07T19:45:30.5025835Z 2025-05-07T19:45:30.5025838Z 2025-05-07T19:45:30.5025865Z 2025-05-07T19:45:30.5025868Z 2025-05-07T19:45:30.5025871Z 2025-05-07T19:45:30.5025875Z 2025-05-07T19:45:30.5025878Z 2025-05-07T19:45:30.5025882Z 2025-05-07T19:45:30.5026011Z  2025-05-07T19:45:30.5026177Z 2025-05-07T19:45:30.5026181Z 2025-05-07T19:45:30.5026184Z 2025-05-07T19:45:30.5026187Z 2025-05-07T19:45:30.5026191Z 2025-05-07T19:45:30.5026219Z 2025-05-07T19:45:30.5026223Z 2025-05-07T19:45:30.5026226Z 2025-05-07T19:45:30.5026233Z 2025-05-07T19:45:30.5026240Z 2025-05-07T19:45:30.5026375Z  2025-05-07T19:45:30.5026550Z 2025-05-07T19:45:30.5026553Z 2025-05-07T19:45:30.5026557Z 2025-05-07T19:45:30.5026560Z 2025-05-07T19:45:30.5026564Z 2025-05-07T19:45:30.5026567Z 2025-05-07T19:45:30.5026594Z 2025-05-07T19:45:30.5026597Z 2025-05-07T19:45:30.5026601Z 2025-05-07T19:45:30.5026604Z 2025-05-07T19:45:30.5026607Z 2025-05-07T19:45:30.5026748Z  2025-05-07T19:45:30.5026936Z 2025-05-07T19:45:30.5026939Z 2025-05-07T19:45:30.5026943Z 2025-05-07T19:45:30.5026946Z 2025-05-07T19:45:30.5026949Z 2025-05-07T19:45:30.5026953Z 2025-05-07T19:45:30.5026981Z 2025-05-07T19:45:30.5026985Z 2025-05-07T19:45:30.5026988Z 2025-05-07T19:45:30.5026991Z 2025-05-07T19:45:30.5026995Z 2025-05-07T19:45:30.5026998Z 2025-05-07T19:45:30.5027141Z  2025-05-07T19:45:30.5027337Z 2025-05-07T19:45:30.5027340Z 2025-05-07T19:45:30.5027344Z 2025-05-07T19:45:30.5027350Z 2025-05-07T19:45:30.5027433Z 2025-05-07T19:45:30.5027437Z 2025-05-07T19:45:30.5027440Z 2025-05-07T19:45:30.5027444Z 2025-05-07T19:45:30.5027447Z 2025-05-07T19:45:30.5027450Z 2025-05-07T19:45:30.5027454Z 2025-05-07T19:45:30.5027457Z 2025-05-07T19:45:30.5027461Z 2025-05-07T19:45:30.5027722Z  2025-05-07T19:45:30.5027911Z 2025-05-07T19:45:30.5027914Z 2025-05-07T19:45:30.5027944Z 2025-05-07T19:45:30.5027947Z 2025-05-07T19:45:30.5027950Z 2025-05-07T19:45:30.5027953Z 2025-05-07T19:45:30.5027956Z 2025-05-07T19:45:30.5027959Z 2025-05-07T19:45:30.5027962Z 2025-05-07T19:45:30.5027965Z 2025-05-07T19:45:30.5027968Z 2025-05-07T19:45:30.5027971Z 2025-05-07T19:45:30.5027974Z 2025-05-07T19:45:30.5027977Z 2025-05-07T19:45:30.5028121Z  2025-05-07T19:45:30.5028355Z 2025-05-07T19:45:30.5028358Z 2025-05-07T19:45:30.5028361Z 2025-05-07T19:45:30.5028364Z 2025-05-07T19:45:30.5028367Z 2025-05-07T19:45:30.5028424Z 2025-05-07T19:45:30.5028430Z 2025-05-07T19:45:30.5028433Z 2025-05-07T19:45:30.5028436Z 2025-05-07T19:45:30.5028439Z 2025-05-07T19:45:30.5028443Z 2025-05-07T19:45:30.5028446Z 2025-05-07T19:45:30.5028449Z 2025-05-07T19:45:30.5028452Z 2025-05-07T19:45:30.5028455Z 2025-05-07T19:45:30.5028606Z  2025-05-07T19:45:30.5028840Z 2025-05-07T19:45:30.5028844Z 2025-05-07T19:45:30.5028847Z 2025-05-07T19:45:30.5028851Z 2025-05-07T19:45:30.5028854Z 2025-05-07T19:45:30.5028857Z 2025-05-07T19:45:30.5028861Z 2025-05-07T19:45:30.5028864Z 2025-05-07T19:45:30.5028867Z 2025-05-07T19:45:30.5028870Z 2025-05-07T19:45:30.5028873Z 2025-05-07T19:45:30.5028876Z 2025-05-07T19:45:30.5028879Z 2025-05-07T19:45:30.5028882Z 2025-05-07T19:45:30.5028885Z 2025-05-07T19:45:30.5028888Z 2025-05-07T19:45:30.5029073Z  2025-05-07T19:45:30.5029277Z 2025-05-07T19:45:30.5029281Z 2025-05-07T19:45:30.5029284Z 2025-05-07T19:45:30.5029291Z 2025-05-07T19:45:30.5029296Z 2025-05-07T19:45:30.5029300Z 2025-05-07T19:45:30.5029303Z 2025-05-07T19:45:30.5029306Z 2025-05-07T19:45:30.5029309Z 2025-05-07T19:45:30.5029312Z 2025-05-07T19:45:30.5029315Z 2025-05-07T19:45:30.5029318Z 2025-05-07T19:45:30.5029322Z 2025-05-07T19:45:30.5029351Z 2025-05-07T19:45:30.5029354Z 2025-05-07T19:45:30.5029357Z 2025-05-07T19:45:30.5029360Z 2025-05-07T19:45:30.5029516Z  2025-05-07T19:45:30.5029733Z 2025-05-07T19:45:30.5029736Z 2025-05-07T19:45:30.5029739Z 2025-05-07T19:45:30.5029743Z 2025-05-07T19:45:30.5029746Z 2025-05-07T19:45:30.5029749Z 2025-05-07T19:45:30.5029781Z 2025-05-07T19:45:30.5029784Z 2025-05-07T19:45:30.5029787Z 2025-05-07T19:45:30.5029790Z 2025-05-07T19:45:30.5029793Z 2025-05-07T19:45:30.5029797Z 2025-05-07T19:45:30.5029800Z 2025-05-07T19:45:30.5029803Z 2025-05-07T19:45:30.5029806Z 2025-05-07T19:45:30.5029809Z 2025-05-07T19:45:30.5029812Z 2025-05-07T19:45:30.5029815Z 2025-05-07T19:45:30.5029983Z  2025-05-07T19:45:30.5030234Z 2025-05-07T19:45:30.5030237Z 2025-05-07T19:45:30.5030334Z  2025-05-07T19:45:30.5030442Z 2025-05-07T19:45:30.5030445Z 2025-05-07T19:45:30.5030580Z  2025-05-07T19:45:30.5030697Z 2025-05-07T19:45:30.5030701Z 2025-05-07T19:45:30.5030704Z 2025-05-07T19:45:30.5030826Z  done 2025-05-07T19:45:30.8177296Z Preparing transaction: | / - done 2025-05-07T19:45:34.3780014Z Verifying transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:45:36.8037676Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:45:37.2184220Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:38.8641670Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:38.8642747Z 2025-05-07T19:45:38.8667071Z 2025-05-07T19:45:38.8698560Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:41.0353705Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:41.0355351Z 2025-05-07T19:45:41.0355496Z Collecting build 2025-05-07T19:45:41.0355877Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:41.0356744Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build) (25.0) 2025-05-07T19:45:41.0357577Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:41.0358071Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:41.0359312Z Requirement already satisfied: tomli>=1.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build) (2.2.1) 2025-05-07T19:45:41.0360032Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:41.0360495Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:41.0360921Z Installing collected packages: pyproject_hooks, build 2025-05-07T19:45:41.0361204Z 2025-05-07T19:45:41.0361398Z Successfully installed build-1.2.2.post1 pyproject_hooks-1.2.0 2025-05-07T19:45:41.0361687Z 2025-05-07T19:45:42.6940895Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:42.6941243Z 2025-05-07T19:45:42.7500836Z [CHECK] Binary make found in PATH 2025-05-07T19:45:44.3428652Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:44.3430137Z 2025-05-07T19:45:44.4016605Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:45.9864061Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:45.9864473Z 2025-05-07T19:45:46.0423570Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:47.7125341Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:49.5120104Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:51.2013602Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:52.9711653Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:54.6210334Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:54.6211001Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:54.6283399Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:54.6283865Z . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:54.6284490Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:54.6284811Z env: 2025-05-07T19:45:54.6285054Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:54.6285398Z BUILD_ENV: build_binary 2025-05-07T19:45:54.6285682Z BUILD_TARGET: default 2025-05-07T19:45:54.6285919Z BUILD_VARIANT: cuda 2025-05-07T19:45:54.6286172Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:54.6286442Z ##[endgroup] 2025-05-07T19:45:55.0990425Z ################################################################################ 2025-05-07T19:45:55.0990897Z # Install CUDA 2025-05-07T19:45:55.0991129Z # 2025-05-07T19:45:55.1008743Z # [2025-05-07T19:45:55.100Z] + install_cuda build_binary 12.6.3 2025-05-07T19:45:55.1009470Z ################################################################################ 2025-05-07T19:45:55.1009847Z 2025-05-07T19:45:55.1021728Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:55.1892906Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:55.1894709Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:55.1901135Z + conda clean --packages --tarball -y 2025-05-07T19:45:55.1901572Z 2025-05-07T19:45:55.7849899Z Will remove 147 (616.0 MB) tarball(s). 2025-05-07T19:45:55.7850444Z Will remove 21 (80.4 MB) package(s). 2025-05-07T19:45:55.8422533Z 2025-05-07T19:45:55.8428152Z + conda clean --all -y 2025-05-07T19:45:55.8428390Z 2025-05-07T19:45:56.4596133Z There are no unused tarball(s) to remove. 2025-05-07T19:45:56.4597137Z Will remove 1 index cache(s). 2025-05-07T19:45:56.4598032Z There are no unused package(s) to remove. 2025-05-07T19:45:56.4598968Z There are no tempfile(s) to remove. 2025-05-07T19:45:56.4599804Z There are no logfile(s) to remove. 2025-05-07T19:45:56.5154535Z 2025-05-07T19:45:56.5168829Z [INSTALL] Installing CUDA 12.6.3 ... 2025-05-07T19:45:56.5203528Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c conda-forge --override-channels -y cuda=12.6.3 2025-05-07T19:45:57.3840936Z Channels: 2025-05-07T19:45:57.3841705Z - conda-forge 2025-05-07T19:45:57.3842383Z Platform: linux-64 2025-05-07T19:46:07.2116915Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:46:08.7091940Z Solving environment: | / - \ done 2025-05-07T19:46:08.8482733Z 2025-05-07T19:46:08.8483315Z ## Package Plan ## 2025-05-07T19:46:08.8484079Z 2025-05-07T19:46:08.8485184Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:08.8486682Z 2025-05-07T19:46:08.8487135Z added / updated specs: 2025-05-07T19:46:08.8487402Z - cuda=12.6.3 2025-05-07T19:46:08.8487573Z 2025-05-07T19:46:08.8487579Z 2025-05-07T19:46:08.8487719Z The following packages will be downloaded: 2025-05-07T19:46:08.8487947Z 2025-05-07T19:46:08.8488167Z package | build 2025-05-07T19:46:08.8488665Z ---------------------------|----------------- 2025-05-07T19:46:08.8489154Z attr-2.5.1 | h166bdaf_1 69 KB conda-forge 2025-05-07T19:46:08.8489576Z binutils-2.40 | h4852527_7 31 KB conda-forge 2025-05-07T19:46:08.8490038Z c-compiler-1.5.2 | h0b41bf4_0 6 KB conda-forge 2025-05-07T19:46:08.8490498Z cuda-12.6.3 | ha804496_0 26 KB conda-forge 2025-05-07T19:46:08.8490939Z cuda-cccl_linux-64-12.6.77 | ha770c72_0 1.0 MB conda-forge 2025-05-07T19:46:08.8491481Z cuda-command-line-tools-12.6.3| ha770c72_0 20 KB conda-forge 2025-05-07T19:46:08.8491992Z cuda-compiler-12.6.3 | hbad6d8a_0 20 KB conda-forge 2025-05-07T19:46:08.8492562Z cuda-crt-dev_linux-64-12.6.85| ha770c72_0 87 KB conda-forge 2025-05-07T19:46:08.8493740Z cuda-crt-tools-12.6.85 | ha770c72_0 26 KB conda-forge 2025-05-07T19:46:08.8494270Z cuda-cudart-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:08.8494838Z cuda-cudart-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:08.8495887Z cuda-cudart-dev_linux-64-12.6.77| h3f2d84a_0 357 KB conda-forge 2025-05-07T19:46:08.8496523Z cuda-cudart-static-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:08.8497080Z cuda-cudart-static_linux-64-12.6.77| h3f2d84a_0 744 KB conda-forge 2025-05-07T19:46:08.8497662Z cuda-cudart_linux-64-12.6.77| h3f2d84a_0 184 KB conda-forge 2025-05-07T19:46:08.8498200Z cuda-cuobjdump-12.6.77 | hbd13f7d_1 241 KB conda-forge 2025-05-07T19:46:08.8498685Z cuda-cupti-12.6.80 | hbd13f7d_0 1.9 MB conda-forge 2025-05-07T19:46:08.8499193Z cuda-cupti-dev-12.6.80 | h5888daf_0 3.4 MB conda-forge 2025-05-07T19:46:08.8499797Z cuda-cuxxfilt-12.6.77 | hbd13f7d_1 211 KB conda-forge 2025-05-07T19:46:08.8500517Z cuda-driver-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:08.8501132Z cuda-driver-dev_linux-64-12.6.77| h3f2d84a_0 35 KB conda-forge 2025-05-07T19:46:08.8501687Z cuda-gdb-12.6.77 | h50b4baa_1 370 KB conda-forge 2025-05-07T19:46:08.8502203Z cuda-libraries-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:46:08.8502722Z cuda-libraries-dev-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:46:08.8503261Z cuda-nsight-12.6.77 | h7938cbb_0 113.2 MB conda-forge 2025-05-07T19:46:08.8503728Z cuda-nvcc-12.6.85 | hcdd1206_0 23 KB conda-forge 2025-05-07T19:46:08.8504246Z cuda-nvcc-dev_linux-64-12.6.85| he91c749_0 10.8 MB conda-forge 2025-05-07T19:46:08.8504787Z cuda-nvcc-impl-12.6.85 | h85509e4_0 25 KB conda-forge 2025-05-07T19:46:08.8505283Z cuda-nvcc-tools-12.6.85 | he02047a_0 23.0 MB conda-forge 2025-05-07T19:46:08.8525907Z cuda-nvcc_linux-64-12.6.85 | h04802cd_0 25 KB conda-forge 2025-05-07T19:46:08.8526487Z cuda-nvdisasm-12.6.77 | hbd13f7d_1 47.6 MB conda-forge 2025-05-07T19:46:08.8527253Z cuda-nvml-dev-12.6.77 | hbd13f7d_1 159 KB conda-forge 2025-05-07T19:46:08.8527783Z cuda-nvprof-12.6.80 | hbd13f7d_0 2.6 MB conda-forge 2025-05-07T19:46:08.8528284Z cuda-nvprune-12.6.77 | hbd13f7d_1 66 KB conda-forge 2025-05-07T19:46:08.8528907Z cuda-nvrtc-12.6.85 | hbd13f7d_0 17.3 MB conda-forge 2025-05-07T19:46:08.8529397Z cuda-nvrtc-dev-12.6.85 | h5888daf_0 31 KB conda-forge 2025-05-07T19:46:08.8529856Z cuda-nvtx-12.6.77 | hbd13f7d_0 31 KB conda-forge 2025-05-07T19:46:08.8530355Z cuda-nvvm-dev_linux-64-12.6.85| ha770c72_0 25 KB conda-forge 2025-05-07T19:46:08.8530845Z cuda-nvvm-impl-12.6.85 | he02047a_0 7.7 MB conda-forge 2025-05-07T19:46:08.8531342Z cuda-nvvm-tools-12.6.85 | he02047a_0 10.4 MB conda-forge 2025-05-07T19:46:08.8531796Z cuda-nvvp-12.6.80 | hbd13f7d_1 109.3 MB conda-forge 2025-05-07T19:46:08.8532266Z cuda-opencl-12.6.77 | hbd13f7d_0 29 KB conda-forge 2025-05-07T19:46:08.8532760Z cuda-opencl-dev-12.6.77 | h5888daf_0 93 KB conda-forge 2025-05-07T19:46:08.8533243Z cuda-profiler-api-12.6.77 | h7938cbb_0 22 KB conda-forge 2025-05-07T19:46:08.8533745Z cuda-runtime-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:46:08.8534225Z cuda-sanitizer-api-12.6.77 | hbd13f7d_1 8.9 MB conda-forge 2025-05-07T19:46:08.8534910Z cuda-toolkit-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:46:08.8535380Z cuda-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:46:08.8535816Z cuda-version-12.6 | h7480c83_3 20 KB conda-forge 2025-05-07T19:46:08.8536294Z cuda-visual-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:46:08.8536767Z cxx-compiler-1.5.2 | hf52228f_0 6 KB conda-forge 2025-05-07T19:46:08.8537214Z dbus-1.13.6 | h5008d03_3 604 KB conda-forge 2025-05-07T19:46:08.8537615Z expat-2.7.0 | h5888daf_0 137 KB conda-forge 2025-05-07T19:46:08.8538015Z gcc-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:46:08.8538451Z gds-tools-1.11.1.6 | h5888daf_4 37.8 MB conda-forge 2025-05-07T19:46:08.8538860Z gmp-6.3.0 | hac33072_2 449 KB conda-forge 2025-05-07T19:46:08.8539264Z gxx-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:46:08.8539778Z libcap-2.75 | h39aace5_0 118 KB conda-forge 2025-05-07T19:46:08.8540427Z libcublas-12.6.4.1 | h5888daf_1 256.2 MB conda-forge 2025-05-07T19:46:08.8540936Z libcublas-dev-12.6.4.1 | h5888daf_1 88 KB conda-forge 2025-05-07T19:46:08.8541421Z libcufft-11.3.0.4 | hbd13f7d_0 156.2 MB conda-forge 2025-05-07T19:46:08.8541913Z libcufft-dev-11.3.0.4 | h5888daf_0 33 KB conda-forge 2025-05-07T19:46:08.8542383Z libcufile-1.11.1.6 | h12f29b5_4 900 KB conda-forge 2025-05-07T19:46:08.8542908Z libcufile-dev-1.11.1.6 | h5888daf_4 35 KB conda-forge 2025-05-07T19:46:08.8543392Z libcurand-10.3.7.77 | hbd13f7d_0 39.9 MB conda-forge 2025-05-07T19:46:08.8543903Z libcurand-dev-10.3.7.77 | h5888daf_0 262 KB conda-forge 2025-05-07T19:46:08.8544391Z libcusolver-11.7.1.2 | h5888daf_1 95.8 MB conda-forge 2025-05-07T19:46:08.8544918Z libcusolver-dev-11.7.1.2 | h5888daf_1 59 KB conda-forge 2025-05-07T19:46:08.8545441Z libcusparse-12.5.4.2 | hbd13f7d_0 118.6 MB conda-forge 2025-05-07T19:46:08.8545945Z libcusparse-dev-12.5.4.2 | h5888daf_0 51 KB conda-forge 2025-05-07T19:46:08.8546551Z libgcrypt-lib-1.11.0 | hb9d3cd8_2 572 KB conda-forge 2025-05-07T19:46:08.8547038Z libgpg-error-1.55 | h3f2d84a_0 305 KB conda-forge 2025-05-07T19:46:08.8547530Z libnl-3.11.0 | hb9d3cd8_0 724 KB conda-forge 2025-05-07T19:46:08.8548079Z libnpp-12.3.1.54 | h5888daf_0 93.4 MB conda-forge 2025-05-07T19:46:08.8548559Z libnpp-dev-12.3.1.54 | h5888daf_0 441 KB conda-forge 2025-05-07T19:46:08.8549013Z libnuma-2.0.18 | h4ab18f5_2 42 KB conda-forge 2025-05-07T19:46:08.8549502Z libnvfatbin-12.6.77 | hbd13f7d_0 783 KB conda-forge 2025-05-07T19:46:08.8550006Z libnvfatbin-dev-12.6.77 | h5888daf_0 26 KB conda-forge 2025-05-07T19:46:08.8550541Z libnvjitlink-12.6.85 | hbd13f7d_0 14.9 MB conda-forge 2025-05-07T19:46:08.8551083Z libnvjitlink-dev-12.6.85 | h5888daf_0 25 KB conda-forge 2025-05-07T19:46:08.8551578Z libnvjpeg-12.3.3.54 | h5888daf_0 2.4 MB conda-forge 2025-05-07T19:46:08.8552084Z libnvjpeg-dev-12.3.3.54 | ha770c72_0 31 KB conda-forge 2025-05-07T19:46:08.8552678Z libsystemd0-257.4 | h4e0b6ca_1 477 KB conda-forge 2025-05-07T19:46:08.8553136Z libudev1-257.4 | hbe16f8c_1 141 KB conda-forge 2025-05-07T19:46:08.8553638Z libxkbcommon-1.7.0 | h2c5496b_1 579 KB conda-forge 2025-05-07T19:46:08.8554083Z libxkbfile-1.1.0 | h166bdaf_1 111 KB conda-forge 2025-05-07T19:46:08.8554504Z lz4-c-1.10.0 | h5888daf_1 163 KB conda-forge 2025-05-07T19:46:08.8554936Z nsight-compute-2024.3.2.3 | hb5ebaad_0 443.1 MB conda-forge 2025-05-07T19:46:08.8555391Z nspr-4.36 | h5888daf_0 225 KB conda-forge 2025-05-07T19:46:08.8555779Z nss-3.111 | h159eef7_0 1.9 MB conda-forge 2025-05-07T19:46:08.8556193Z ocl-icd-2.3.3 | hb9d3cd8_0 104 KB conda-forge 2025-05-07T19:46:08.8556669Z opencl-headers-2024.10.24 | h5888daf_0 53 KB conda-forge 2025-05-07T19:46:08.8557125Z rdma-core-57.0 | h5888daf_0 1.2 MB conda-forge 2025-05-07T19:46:08.8557563Z wayland-1.23.1 | h3e06ad9_0 314 KB conda-forge 2025-05-07T19:46:08.8557983Z xcb-util-0.4.1 | hb711507_2 19 KB conda-forge 2025-05-07T19:46:08.8558449Z xcb-util-cursor-0.1.5 | hb9d3cd8_0 20 KB conda-forge 2025-05-07T19:46:08.8558906Z xcb-util-image-0.4.0 | hb711507_2 24 KB conda-forge 2025-05-07T19:46:08.8559390Z xcb-util-keysyms-0.4.1 | hb711507_0 14 KB conda-forge 2025-05-07T19:46:08.8559899Z xcb-util-renderutil-0.3.10 | hb711507_0 17 KB conda-forge 2025-05-07T19:46:08.8560355Z xcb-util-wm-0.4.2 | hb711507_0 50 KB conda-forge 2025-05-07T19:46:08.8560828Z xkeyboard-config-2.44 | hb9d3cd8_0 384 KB conda-forge 2025-05-07T19:46:08.8561317Z xorg-libxcomposite-0.4.6 | hb9d3cd8_2 13 KB conda-forge 2025-05-07T19:46:08.8561824Z xorg-libxdamage-1.1.6 | hb9d3cd8_0 13 KB conda-forge 2025-05-07T19:46:08.8562250Z ------------------------------------------------------------ 2025-05-07T19:46:08.8562619Z Total: 1.59 GB 2025-05-07T19:46:08.8562830Z 2025-05-07T19:46:08.8562986Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:08.8563213Z 2025-05-07T19:46:08.8563397Z attr conda-forge/linux-64::attr-2.5.1-h166bdaf_1 2025-05-07T19:46:08.8563844Z binutils conda-forge/linux-64::binutils-2.40-h4852527_7 2025-05-07T19:46:08.8564387Z c-compiler conda-forge/linux-64::c-compiler-1.5.2-h0b41bf4_0 2025-05-07T19:46:08.8564857Z cuda conda-forge/noarch::cuda-12.6.3-ha804496_0 2025-05-07T19:46:08.8565370Z cuda-cccl_linux-64 conda-forge/noarch::cuda-cccl_linux-64-12.6.77-ha770c72_0 2025-05-07T19:46:08.8565986Z cuda-command-line~ conda-forge/linux-64::cuda-command-line-tools-12.6.3-ha770c72_0 2025-05-07T19:46:08.8566613Z cuda-compiler conda-forge/noarch::cuda-compiler-12.6.3-hbad6d8a_0 2025-05-07T19:46:08.8567183Z cuda-crt-dev_linu~ conda-forge/noarch::cuda-crt-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:46:08.8567778Z cuda-crt-tools conda-forge/linux-64::cuda-crt-tools-12.6.85-ha770c72_0 2025-05-07T19:46:08.8568329Z cuda-cudart conda-forge/linux-64::cuda-cudart-12.6.77-h5888daf_0 2025-05-07T19:46:08.8568855Z cuda-cudart-dev conda-forge/linux-64::cuda-cudart-dev-12.6.77-h5888daf_0 2025-05-07T19:46:08.8569465Z cuda-cudart-dev_l~ conda-forge/noarch::cuda-cudart-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:08.8570091Z cuda-cudart-static conda-forge/linux-64::cuda-cudart-static-12.6.77-h5888daf_0 2025-05-07T19:46:08.8570743Z cuda-cudart-stati~ conda-forge/noarch::cuda-cudart-static_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:08.8571386Z cuda-cudart_linux~ conda-forge/noarch::cuda-cudart_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:08.8571958Z cuda-cuobjdump conda-forge/linux-64::cuda-cuobjdump-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.8572504Z cuda-cupti conda-forge/linux-64::cuda-cupti-12.6.80-hbd13f7d_0 2025-05-07T19:46:08.8573128Z cuda-cupti-dev conda-forge/linux-64::cuda-cupti-dev-12.6.80-h5888daf_0 2025-05-07T19:46:08.8573712Z cuda-cuxxfilt conda-forge/linux-64::cuda-cuxxfilt-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.8574293Z cuda-driver-dev conda-forge/linux-64::cuda-driver-dev-12.6.77-h5888daf_0 2025-05-07T19:46:08.8574881Z cuda-driver-dev_l~ conda-forge/noarch::cuda-driver-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:08.8575456Z cuda-gdb conda-forge/linux-64::cuda-gdb-12.6.77-h50b4baa_1 2025-05-07T19:46:08.8575953Z cuda-libraries conda-forge/linux-64::cuda-libraries-12.6.3-ha770c72_0 2025-05-07T19:46:08.8576736Z cuda-libraries-dev conda-forge/linux-64::cuda-libraries-dev-12.6.3-ha770c72_0 2025-05-07T19:46:08.8577348Z cuda-nsight conda-forge/linux-64::cuda-nsight-12.6.77-h7938cbb_0 2025-05-07T19:46:08.8577869Z cuda-nvcc conda-forge/linux-64::cuda-nvcc-12.6.85-hcdd1206_0 2025-05-07T19:46:08.8578450Z cuda-nvcc-dev_lin~ conda-forge/noarch::cuda-nvcc-dev_linux-64-12.6.85-he91c749_0 2025-05-07T19:46:08.8579069Z cuda-nvcc-impl conda-forge/linux-64::cuda-nvcc-impl-12.6.85-h85509e4_0 2025-05-07T19:46:08.8579773Z cuda-nvcc-tools conda-forge/linux-64::cuda-nvcc-tools-12.6.85-he02047a_0 2025-05-07T19:46:08.8580624Z cuda-nvcc_linux-64 conda-forge/linux-64::cuda-nvcc_linux-64-12.6.85-h04802cd_0 2025-05-07T19:46:08.8581225Z cuda-nvdisasm conda-forge/linux-64::cuda-nvdisasm-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.8581837Z cuda-nvml-dev conda-forge/linux-64::cuda-nvml-dev-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.8582399Z cuda-nvprof conda-forge/linux-64::cuda-nvprof-12.6.80-hbd13f7d_0 2025-05-07T19:46:08.8582982Z cuda-nvprune conda-forge/linux-64::cuda-nvprune-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.8583547Z cuda-nvrtc conda-forge/linux-64::cuda-nvrtc-12.6.85-hbd13f7d_0 2025-05-07T19:46:08.8584094Z cuda-nvrtc-dev conda-forge/linux-64::cuda-nvrtc-dev-12.6.85-h5888daf_0 2025-05-07T19:46:08.8584674Z cuda-nvtx conda-forge/linux-64::cuda-nvtx-12.6.77-hbd13f7d_0 2025-05-07T19:46:08.8585241Z cuda-nvvm-dev_lin~ conda-forge/noarch::cuda-nvvm-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:46:08.8585879Z cuda-nvvm-impl conda-forge/linux-64::cuda-nvvm-impl-12.6.85-he02047a_0 2025-05-07T19:46:08.8586499Z cuda-nvvm-tools conda-forge/linux-64::cuda-nvvm-tools-12.6.85-he02047a_0 2025-05-07T19:46:08.8587129Z cuda-nvvp conda-forge/linux-64::cuda-nvvp-12.6.80-hbd13f7d_1 2025-05-07T19:46:08.8587678Z cuda-opencl conda-forge/linux-64::cuda-opencl-12.6.77-hbd13f7d_0 2025-05-07T19:46:08.8588243Z cuda-opencl-dev conda-forge/linux-64::cuda-opencl-dev-12.6.77-h5888daf_0 2025-05-07T19:46:08.8588887Z cuda-profiler-api conda-forge/linux-64::cuda-profiler-api-12.6.77-h7938cbb_0 2025-05-07T19:46:08.8589499Z cuda-runtime conda-forge/noarch::cuda-runtime-12.6.3-ha804496_0 2025-05-07T19:46:08.8590096Z cuda-sanitizer-api conda-forge/linux-64::cuda-sanitizer-api-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.8590719Z cuda-toolkit conda-forge/noarch::cuda-toolkit-12.6.3-ha804496_0 2025-05-07T19:46:08.8591240Z cuda-tools conda-forge/linux-64::cuda-tools-12.6.3-ha770c72_0 2025-05-07T19:46:08.8591780Z cuda-version conda-forge/noarch::cuda-version-12.6-h7480c83_3 2025-05-07T19:46:08.8592498Z cuda-visual-tools conda-forge/linux-64::cuda-visual-tools-12.6.3-ha770c72_0 2025-05-07T19:46:08.8593080Z cxx-compiler conda-forge/linux-64::cxx-compiler-1.5.2-hf52228f_0 2025-05-07T19:46:08.8593585Z dbus conda-forge/linux-64::dbus-1.13.6-h5008d03_3 2025-05-07T19:46:08.8594009Z expat conda-forge/linux-64::expat-2.7.0-h5888daf_0 2025-05-07T19:46:08.8594453Z gcc conda-forge/linux-64::gcc-11.4.0-h602e360_13 2025-05-07T19:46:08.8594932Z gds-tools conda-forge/linux-64::gds-tools-1.11.1.6-h5888daf_4 2025-05-07T19:46:08.8595385Z gmp conda-forge/linux-64::gmp-6.3.0-hac33072_2 2025-05-07T19:46:08.8595888Z gxx conda-forge/linux-64::gxx-11.4.0-h602e360_13 2025-05-07T19:46:08.8596315Z libcap conda-forge/linux-64::libcap-2.75-h39aace5_0 2025-05-07T19:46:08.8596804Z libcublas conda-forge/linux-64::libcublas-12.6.4.1-h5888daf_1 2025-05-07T19:46:08.8597359Z libcublas-dev conda-forge/linux-64::libcublas-dev-12.6.4.1-h5888daf_1 2025-05-07T19:46:08.8597885Z libcufft conda-forge/linux-64::libcufft-11.3.0.4-hbd13f7d_0 2025-05-07T19:46:08.8598435Z libcufft-dev conda-forge/linux-64::libcufft-dev-11.3.0.4-h5888daf_0 2025-05-07T19:46:08.8598963Z libcufile conda-forge/linux-64::libcufile-1.11.1.6-h12f29b5_4 2025-05-07T19:46:08.8599529Z libcufile-dev conda-forge/linux-64::libcufile-dev-1.11.1.6-h5888daf_4 2025-05-07T19:46:08.8600094Z libcurand conda-forge/linux-64::libcurand-10.3.7.77-hbd13f7d_0 2025-05-07T19:46:08.8600637Z libcurand-dev conda-forge/linux-64::libcurand-dev-10.3.7.77-h5888daf_0 2025-05-07T19:46:08.8601216Z libcusolver conda-forge/linux-64::libcusolver-11.7.1.2-h5888daf_1 2025-05-07T19:46:08.8601784Z libcusolver-dev conda-forge/linux-64::libcusolver-dev-11.7.1.2-h5888daf_1 2025-05-07T19:46:08.8602377Z libcusparse conda-forge/linux-64::libcusparse-12.5.4.2-hbd13f7d_0 2025-05-07T19:46:08.8602978Z libcusparse-dev conda-forge/linux-64::libcusparse-dev-12.5.4.2-h5888daf_0 2025-05-07T19:46:08.8603558Z libgcrypt-lib conda-forge/linux-64::libgcrypt-lib-1.11.0-hb9d3cd8_2 2025-05-07T19:46:08.8604121Z libgpg-error conda-forge/linux-64::libgpg-error-1.55-h3f2d84a_0 2025-05-07T19:46:08.8604599Z libnl conda-forge/linux-64::libnl-3.11.0-hb9d3cd8_0 2025-05-07T19:46:08.8605070Z libnpp conda-forge/linux-64::libnpp-12.3.1.54-h5888daf_0 2025-05-07T19:46:08.8605704Z libnpp-dev conda-forge/linux-64::libnpp-dev-12.3.1.54-h5888daf_0 2025-05-07T19:46:08.8606179Z libnuma conda-forge/linux-64::libnuma-2.0.18-h4ab18f5_2 2025-05-07T19:46:08.8606678Z libnvfatbin conda-forge/linux-64::libnvfatbin-12.6.77-hbd13f7d_0 2025-05-07T19:46:08.8607217Z libnvfatbin-dev conda-forge/linux-64::libnvfatbin-dev-12.6.77-h5888daf_0 2025-05-07T19:46:08.8607787Z libnvjitlink conda-forge/linux-64::libnvjitlink-12.6.85-hbd13f7d_0 2025-05-07T19:46:08.8608363Z libnvjitlink-dev conda-forge/linux-64::libnvjitlink-dev-12.6.85-h5888daf_0 2025-05-07T19:46:08.8608968Z libnvjpeg conda-forge/linux-64::libnvjpeg-12.3.3.54-h5888daf_0 2025-05-07T19:46:08.8609511Z libnvjpeg-dev conda-forge/linux-64::libnvjpeg-dev-12.3.3.54-ha770c72_0 2025-05-07T19:46:08.8610031Z libsystemd0 conda-forge/linux-64::libsystemd0-257.4-h4e0b6ca_1 2025-05-07T19:46:08.8610726Z libudev1 conda-forge/linux-64::libudev1-257.4-hbe16f8c_1 2025-05-07T19:46:08.8611257Z libxkbcommon conda-forge/linux-64::libxkbcommon-1.7.0-h2c5496b_1 2025-05-07T19:46:08.8611784Z libxkbfile conda-forge/linux-64::libxkbfile-1.1.0-h166bdaf_1 2025-05-07T19:46:08.8612268Z lz4-c conda-forge/linux-64::lz4-c-1.10.0-h5888daf_1 2025-05-07T19:46:08.8612779Z nsight-compute conda-forge/linux-64::nsight-compute-2024.3.2.3-hb5ebaad_0 2025-05-07T19:46:08.8613311Z nspr conda-forge/linux-64::nspr-4.36-h5888daf_0 2025-05-07T19:46:08.8613891Z nss conda-forge/linux-64::nss-3.111-h159eef7_0 2025-05-07T19:46:08.8614309Z ocl-icd conda-forge/linux-64::ocl-icd-2.3.3-hb9d3cd8_0 2025-05-07T19:46:08.8614851Z opencl-headers conda-forge/linux-64::opencl-headers-2024.10.24-h5888daf_0 2025-05-07T19:46:08.8615383Z rdma-core conda-forge/linux-64::rdma-core-57.0-h5888daf_0 2025-05-07T19:46:08.8616054Z wayland conda-forge/linux-64::wayland-1.23.1-h3e06ad9_0 2025-05-07T19:46:08.8616522Z xcb-util conda-forge/linux-64::xcb-util-0.4.1-hb711507_2 2025-05-07T19:46:08.8617150Z xcb-util-cursor conda-forge/linux-64::xcb-util-cursor-0.1.5-hb9d3cd8_0 2025-05-07T19:46:08.8617758Z xcb-util-image conda-forge/linux-64::xcb-util-image-0.4.0-hb711507_2 2025-05-07T19:46:08.8618345Z xcb-util-keysyms conda-forge/linux-64::xcb-util-keysyms-0.4.1-hb711507_0 2025-05-07T19:46:08.8618994Z xcb-util-renderut~ conda-forge/linux-64::xcb-util-renderutil-0.3.10-hb711507_0 2025-05-07T19:46:08.8619661Z xcb-util-wm conda-forge/linux-64::xcb-util-wm-0.4.2-hb711507_0 2025-05-07T19:46:08.8620251Z xkeyboard-config conda-forge/linux-64::xkeyboard-config-2.44-hb9d3cd8_0 2025-05-07T19:46:08.8620922Z xorg-libxcomposite conda-forge/linux-64::xorg-libxcomposite-0.4.6-hb9d3cd8_2 2025-05-07T19:46:08.8621558Z xorg-libxdamage conda-forge/linux-64::xorg-libxdamage-1.1.6-hb9d3cd8_0 2025-05-07T19:46:08.8621943Z 2025-05-07T19:46:08.8622160Z 2025-05-07T19:46:08.8622168Z 2025-05-07T19:46:08.8622334Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:08.8622779Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:46:08.8623040Z 2025-05-07T19:46:08.8623380Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:46:08.8623642Z 2025-05-07T19:46:08.8623673Z 2025-05-07T19:46:08.8623901Z libcufft-11.3.0.4 | 156.2 MB | | 0%  2025-05-07T19:46:08.8624166Z 2025-05-07T19:46:08.8624170Z 2025-05-07T19:46:08.8624173Z 2025-05-07T19:46:08.8624442Z libcusparse-12.5.4.2 | 118.6 MB | | 0%  2025-05-07T19:46:08.8624732Z 2025-05-07T19:46:08.8624736Z 2025-05-07T19:46:08.8624740Z 2025-05-07T19:46:08.8624743Z 2025-05-07T19:46:08.8636345Z cuda-nsight-12.6.77 | 113.2 MB | | 0%  2025-05-07T19:46:08.8637224Z 2025-05-07T19:46:08.8637235Z 2025-05-07T19:46:08.8637246Z 2025-05-07T19:46:08.8637256Z 2025-05-07T19:46:08.8637266Z 2025-05-07T19:46:08.8637966Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:46:08.8638765Z 2025-05-07T19:46:08.8638808Z 2025-05-07T19:46:08.8638818Z 2025-05-07T19:46:08.8638828Z 2025-05-07T19:46:08.8638859Z 2025-05-07T19:46:08.8638870Z 2025-05-07T19:46:08.8639629Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:46:08.8640489Z 2025-05-07T19:46:08.8640500Z 2025-05-07T19:46:08.8640510Z 2025-05-07T19:46:08.8640520Z 2025-05-07T19:46:08.8640531Z 2025-05-07T19:46:08.8640578Z 2025-05-07T19:46:08.8640589Z 2025-05-07T19:46:08.8641320Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:46:08.8642438Z 2025-05-07T19:46:08.8642441Z 2025-05-07T19:46:08.8642445Z 2025-05-07T19:46:08.8642449Z 2025-05-07T19:46:08.8642452Z 2025-05-07T19:46:08.8642484Z 2025-05-07T19:46:08.8642487Z 2025-05-07T19:46:08.8642491Z 2025-05-07T19:46:08.8642774Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:46:08.8643089Z 2025-05-07T19:46:08.8643092Z 2025-05-07T19:46:08.8643096Z 2025-05-07T19:46:08.8643100Z 2025-05-07T19:46:08.8643103Z 2025-05-07T19:46:08.8643106Z 2025-05-07T19:46:08.8643117Z 2025-05-07T19:46:08.8643149Z 2025-05-07T19:46:08.8643152Z 2025-05-07T19:46:08.8643427Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:46:08.8643731Z 2025-05-07T19:46:08.8643734Z 2025-05-07T19:46:08.8643738Z 2025-05-07T19:46:08.8643752Z 2025-05-07T19:46:08.8643755Z 2025-05-07T19:46:08.8643759Z 2025-05-07T19:46:08.8643762Z 2025-05-07T19:46:08.8643792Z 2025-05-07T19:46:08.8643800Z 2025-05-07T19:46:08.8643804Z 2025-05-07T19:46:08.8644552Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:46:08.8644854Z 2025-05-07T19:46:08.8644869Z 2025-05-07T19:46:08.8644875Z 2025-05-07T19:46:08.8644878Z 2025-05-07T19:46:08.8644882Z 2025-05-07T19:46:08.8644910Z 2025-05-07T19:46:08.8644913Z 2025-05-07T19:46:08.8644917Z 2025-05-07T19:46:08.8644927Z 2025-05-07T19:46:08.8644930Z 2025-05-07T19:46:08.8644934Z 2025-05-07T19:46:08.8646755Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:46:08.8647190Z 2025-05-07T19:46:08.8647197Z 2025-05-07T19:46:08.8647200Z 2025-05-07T19:46:08.8647204Z 2025-05-07T19:46:08.8647207Z 2025-05-07T19:46:08.8647211Z 2025-05-07T19:46:08.8647214Z 2025-05-07T19:46:08.8647218Z 2025-05-07T19:46:08.8647241Z 2025-05-07T19:46:08.8647246Z 2025-05-07T19:46:08.8647250Z 2025-05-07T19:46:08.8647270Z 2025-05-07T19:46:08.8647569Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:46:08.8647905Z 2025-05-07T19:46:08.8647909Z 2025-05-07T19:46:08.8647912Z 2025-05-07T19:46:08.8647915Z 2025-05-07T19:46:08.8647919Z 2025-05-07T19:46:08.8647939Z 2025-05-07T19:46:08.8647942Z 2025-05-07T19:46:08.8647945Z 2025-05-07T19:46:08.8647949Z 2025-05-07T19:46:08.8647952Z 2025-05-07T19:46:08.8647956Z 2025-05-07T19:46:08.8647959Z 2025-05-07T19:46:08.8647962Z 2025-05-07T19:46:08.8648846Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:46:08.8649182Z 2025-05-07T19:46:08.8649186Z 2025-05-07T19:46:08.8649189Z 2025-05-07T19:46:08.8649192Z 2025-05-07T19:46:08.8649196Z 2025-05-07T19:46:08.8649199Z 2025-05-07T19:46:08.8649202Z 2025-05-07T19:46:08.8649206Z 2025-05-07T19:46:08.8649209Z 2025-05-07T19:46:08.8649213Z 2025-05-07T19:46:08.8649216Z 2025-05-07T19:46:08.8649223Z 2025-05-07T19:46:08.8649227Z 2025-05-07T19:46:08.8649230Z 2025-05-07T19:46:08.8649976Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:46:08.8650330Z 2025-05-07T19:46:08.8650333Z 2025-05-07T19:46:08.8650337Z 2025-05-07T19:46:08.8650340Z 2025-05-07T19:46:08.8650343Z 2025-05-07T19:46:08.8650347Z 2025-05-07T19:46:08.8650350Z 2025-05-07T19:46:08.8650353Z 2025-05-07T19:46:08.8650357Z 2025-05-07T19:46:08.8650360Z 2025-05-07T19:46:08.8650363Z 2025-05-07T19:46:08.8650367Z 2025-05-07T19:46:08.8650370Z 2025-05-07T19:46:08.8650373Z 2025-05-07T19:46:08.8650377Z 2025-05-07T19:46:08.8651055Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:46:08.8651379Z 2025-05-07T19:46:08.8651391Z 2025-05-07T19:46:08.8651395Z 2025-05-07T19:46:08.8651400Z 2025-05-07T19:46:08.8651403Z 2025-05-07T19:46:08.8651407Z 2025-05-07T19:46:08.8651410Z 2025-05-07T19:46:08.8651413Z 2025-05-07T19:46:08.8651431Z 2025-05-07T19:46:08.8651435Z 2025-05-07T19:46:08.8651438Z 2025-05-07T19:46:08.8651572Z 2025-05-07T19:46:08.8651576Z 2025-05-07T19:46:08.8651579Z 2025-05-07T19:46:08.8651583Z 2025-05-07T19:46:08.8651586Z 2025-05-07T19:46:08.8652087Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:46:08.8652425Z 2025-05-07T19:46:08.8652454Z 2025-05-07T19:46:08.8652458Z 2025-05-07T19:46:08.8652461Z 2025-05-07T19:46:08.8652465Z 2025-05-07T19:46:08.8652468Z 2025-05-07T19:46:08.8652471Z 2025-05-07T19:46:08.8652475Z 2025-05-07T19:46:08.8652478Z 2025-05-07T19:46:08.8652482Z 2025-05-07T19:46:08.8652485Z 2025-05-07T19:46:08.8652493Z 2025-05-07T19:46:08.8652496Z 2025-05-07T19:46:08.8652500Z 2025-05-07T19:46:08.8652503Z 2025-05-07T19:46:08.8652507Z 2025-05-07T19:46:08.8652510Z 2025-05-07T19:46:08.8653860Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:46:08.8654180Z 2025-05-07T19:46:08.8654184Z 2025-05-07T19:46:08.8654197Z 2025-05-07T19:46:08.8654206Z 2025-05-07T19:46:08.8654209Z 2025-05-07T19:46:08.8654213Z 2025-05-07T19:46:08.8654216Z 2025-05-07T19:46:08.8654220Z 2025-05-07T19:46:08.8654223Z 2025-05-07T19:46:08.8654226Z 2025-05-07T19:46:08.8654230Z 2025-05-07T19:46:08.8654233Z 2025-05-07T19:46:08.8654237Z 2025-05-07T19:46:08.8654240Z 2025-05-07T19:46:08.8654257Z 2025-05-07T19:46:08.8654260Z 2025-05-07T19:46:08.8654263Z 2025-05-07T19:46:08.8654267Z 2025-05-07T19:46:08.8654853Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:46:08.8655191Z 2025-05-07T19:46:08.8655265Z 2025-05-07T19:46:08.8655287Z 2025-05-07T19:46:08.8655290Z 2025-05-07T19:46:08.8655294Z 2025-05-07T19:46:08.8655297Z 2025-05-07T19:46:08.8655300Z 2025-05-07T19:46:08.8655304Z 2025-05-07T19:46:08.8655307Z 2025-05-07T19:46:08.8655310Z 2025-05-07T19:46:08.8655314Z 2025-05-07T19:46:08.8655318Z 2025-05-07T19:46:08.8655321Z 2025-05-07T19:46:08.8655324Z 2025-05-07T19:46:08.8655328Z 2025-05-07T19:46:08.8655335Z 2025-05-07T19:46:08.8655339Z 2025-05-07T19:46:08.8655342Z 2025-05-07T19:46:08.8655346Z 2025-05-07T19:46:08.9589705Z ... (more hidden) ... 2025-05-07T19:46:08.9590040Z 2025-05-07T19:46:08.9593434Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:46:08.9596154Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:46:08.9596405Z 2025-05-07T19:46:08.9597186Z 2025-05-07T19:46:08.9653483Z libcufft-11.3.0.4 | 156.2 MB | 2 | 2%  2025-05-07T19:46:08.9653776Z 2025-05-07T19:46:08.9653805Z 2025-05-07T19:46:08.9653811Z 2025-05-07T19:46:08.9653815Z 2025-05-07T19:46:08.9936340Z cuda-nsight-12.6.77 | 113.2 MB | 1 | 1%  2025-05-07T19:46:08.9936679Z 2025-05-07T19:46:08.9936684Z 2025-05-07T19:46:08.9936800Z 2025-05-07T19:46:09.0591732Z libcusparse-12.5.4.2 | 118.6 MB | | 0%  2025-05-07T19:46:09.0592640Z 2025-05-07T19:46:09.0597162Z libcublas-12.6.4.1 | 256.2 MB | 3 | 3%  2025-05-07T19:46:09.0598958Z nsight-compute-2024. | 443.1 MB | 1 | 1% 2025-05-07T19:46:09.0599730Z 2025-05-07T19:46:09.0599753Z 2025-05-07T19:46:09.0653812Z libcufft-11.3.0.4 | 156.2 MB | 5 | 5%  2025-05-07T19:46:09.0654112Z 2025-05-07T19:46:09.0654304Z 2025-05-07T19:46:09.0654313Z 2025-05-07T19:46:09.0654319Z 2025-05-07T19:46:09.0939731Z cuda-nsight-12.6.77 | 113.2 MB | 5 | 5%  2025-05-07T19:46:09.0940088Z 2025-05-07T19:46:09.0940094Z 2025-05-07T19:46:09.0940099Z 2025-05-07T19:46:09.1594967Z libcusparse-12.5.4.2 | 118.6 MB | 4 | 4%  2025-05-07T19:46:09.1600574Z nsight-compute-2024. | 443.1 MB | 2 | 3% 2025-05-07T19:46:09.1600869Z 2025-05-07T19:46:09.1600874Z 2025-05-07T19:46:09.1657479Z libcufft-11.3.0.4 | 156.2 MB | # | 10%  2025-05-07T19:46:09.1657815Z 2025-05-07T19:46:09.1657821Z 2025-05-07T19:46:09.1657826Z 2025-05-07T19:46:09.1658099Z 2025-05-07T19:46:09.1952069Z cuda-nsight-12.6.77 | 113.2 MB | # | 10%  2025-05-07T19:46:09.1952982Z 2025-05-07T19:46:09.1952998Z 2025-05-07T19:46:09.1953040Z 2025-05-07T19:46:09.2301875Z libcusparse-12.5.4.2 | 118.6 MB | #1 | 11%  2025-05-07T19:46:09.2302203Z 2025-05-07T19:46:09.2602111Z libcublas-12.6.4.1 | 256.2 MB | 5 | 5%  2025-05-07T19:46:09.2602772Z 2025-05-07T19:46:09.2602795Z 2025-05-07T19:46:09.2720703Z libcufft-11.3.0.4 | 156.2 MB | #6 | 17%  2025-05-07T19:46:09.2721027Z 2025-05-07T19:46:09.2721053Z 2025-05-07T19:46:09.2721061Z 2025-05-07T19:46:09.2721093Z 2025-05-07T19:46:09.2951676Z cuda-nsight-12.6.77 | 113.2 MB | #4 | 14%  2025-05-07T19:46:09.2952007Z 2025-05-07T19:46:09.2952013Z 2025-05-07T19:46:09.2952039Z 2025-05-07T19:46:09.2991248Z libcusparse-12.5.4.2 | 118.6 MB | #6 | 16%  2025-05-07T19:46:09.3301662Z nsight-compute-2024. | 443.1 MB | 3 | 4% 2025-05-07T19:46:09.3301992Z 2025-05-07T19:46:09.3722641Z libcublas-12.6.4.1 | 256.2 MB | 8 | 8%  2025-05-07T19:46:09.3723483Z 2025-05-07T19:46:09.3723498Z 2025-05-07T19:46:09.3723509Z 2025-05-07T19:46:09.3723519Z 2025-05-07T19:46:09.3782032Z cuda-nsight-12.6.77 | 113.2 MB | #9 | 20%  2025-05-07T19:46:09.3782350Z 2025-05-07T19:46:09.3782356Z 2025-05-07T19:46:09.3956719Z libcufft-11.3.0.4 | 156.2 MB | ##1 | 21%  2025-05-07T19:46:09.3957165Z 2025-05-07T19:46:09.3957234Z 2025-05-07T19:46:09.3957240Z 2025-05-07T19:46:09.3995826Z libcusparse-12.5.4.2 | 118.6 MB | ##1 | 21%  2025-05-07T19:46:09.4301876Z nsight-compute-2024. | 443.1 MB | 4 | 5% 2025-05-07T19:46:09.4302165Z 2025-05-07T19:46:09.4722740Z libcublas-12.6.4.1 | 256.2 MB | # | 10%  2025-05-07T19:46:09.4723088Z 2025-05-07T19:46:09.4723168Z 2025-05-07T19:46:09.4723174Z 2025-05-07T19:46:09.4723216Z 2025-05-07T19:46:09.4901307Z cuda-nsight-12.6.77 | 113.2 MB | ##5 | 25%  2025-05-07T19:46:09.4901653Z 2025-05-07T19:46:09.4901667Z 2025-05-07T19:46:09.4958962Z libcufft-11.3.0.4 | 156.2 MB | ##5 | 26%  2025-05-07T19:46:09.4959254Z 2025-05-07T19:46:09.4959259Z 2025-05-07T19:46:09.4959263Z 2025-05-07T19:46:09.4995716Z libcusparse-12.5.4.2 | 118.6 MB | ##6 | 27%  2025-05-07T19:46:09.5302774Z nsight-compute-2024. | 443.1 MB | 5 | 6% 2025-05-07T19:46:09.5303577Z 2025-05-07T19:46:09.5724515Z libcublas-12.6.4.1 | 256.2 MB | #2 | 13%  2025-05-07T19:46:09.5725380Z 2025-05-07T19:46:09.5725395Z 2025-05-07T19:46:09.5725407Z 2025-05-07T19:46:09.5725417Z 2025-05-07T19:46:09.5958988Z cuda-nsight-12.6.77 | 113.2 MB | ### | 31%  2025-05-07T19:46:09.5959907Z 2025-05-07T19:46:09.5959923Z 2025-05-07T19:46:09.5959934Z 2025-05-07T19:46:09.5965012Z libcusparse-12.5.4.2 | 118.6 MB | ###2 | 32%  2025-05-07T19:46:09.5965329Z 2025-05-07T19:46:09.5965354Z 2025-05-07T19:46:09.6001134Z libcufft-11.3.0.4 | 156.2 MB | ### | 30%  2025-05-07T19:46:09.6304567Z nsight-compute-2024. | 443.1 MB | 6 | 7% 2025-05-07T19:46:09.6305364Z 2025-05-07T19:46:09.6724397Z libcublas-12.6.4.1 | 256.2 MB | #4 | 15%  2025-05-07T19:46:09.6724689Z 2025-05-07T19:46:09.6724844Z 2025-05-07T19:46:09.6724852Z 2025-05-07T19:46:09.6724858Z 2025-05-07T19:46:09.6960001Z cuda-nsight-12.6.77 | 113.2 MB | ###6 | 37%  2025-05-07T19:46:09.6960486Z 2025-05-07T19:46:09.6960491Z 2025-05-07T19:46:09.6960511Z 2025-05-07T19:46:09.6967826Z libcusparse-12.5.4.2 | 118.6 MB | ###8 | 38%  2025-05-07T19:46:09.6968685Z 2025-05-07T19:46:09.6968696Z 2025-05-07T19:46:09.7177513Z libcufft-11.3.0.4 | 156.2 MB | ###4 | 35%  2025-05-07T19:46:09.7305575Z nsight-compute-2024. | 443.1 MB | 7 | 8% 2025-05-07T19:46:09.7306453Z 2025-05-07T19:46:09.7961764Z libcublas-12.6.4.1 | 256.2 MB | #7 | 18%  2025-05-07T19:46:09.7962291Z 2025-05-07T19:46:09.7962295Z 2025-05-07T19:46:09.7962299Z 2025-05-07T19:46:09.7969200Z libcusparse-12.5.4.2 | 118.6 MB | ####4 | 44%  2025-05-07T19:46:09.7969512Z 2025-05-07T19:46:09.7969522Z 2025-05-07T19:46:09.8069184Z libcufft-11.3.0.4 | 156.2 MB | ###9 | 39%  2025-05-07T19:46:09.8069488Z 2025-05-07T19:46:09.8069493Z 2025-05-07T19:46:09.8069518Z 2025-05-07T19:46:09.8069521Z 2025-05-07T19:46:09.8237063Z cuda-nsight-12.6.77 | 113.2 MB | ####2 | 42%  2025-05-07T19:46:09.8306794Z nsight-compute-2024. | 443.1 MB | 9 | 9% 2025-05-07T19:46:09.8307088Z 2025-05-07T19:46:09.8961503Z libcublas-12.6.4.1 | 256.2 MB | ## | 20%  2025-05-07T19:46:09.8961821Z 2025-05-07T19:46:09.8961916Z 2025-05-07T19:46:09.8961920Z 2025-05-07T19:46:09.9108426Z libcusparse-12.5.4.2 | 118.6 MB | #####2 | 52%  2025-05-07T19:46:09.9108772Z 2025-05-07T19:46:09.9108777Z 2025-05-07T19:46:09.9140862Z libcufft-11.3.0.4 | 156.2 MB | ####3 | 44%  2025-05-07T19:46:09.9141699Z 2025-05-07T19:46:09.9141738Z 2025-05-07T19:46:09.9141749Z 2025-05-07T19:46:09.9141760Z 2025-05-07T19:46:09.9317267Z cuda-nsight-12.6.77 | 113.2 MB | ####8 | 49%  2025-05-07T19:46:09.9317600Z 2025-05-07T19:46:10.0046056Z libcublas-12.6.4.1 | 256.2 MB | ##2 | 23%  2025-05-07T19:46:10.0176894Z nsight-compute-2024. | 443.1 MB | # | 10% 2025-05-07T19:46:10.0177505Z 2025-05-07T19:46:10.0177541Z 2025-05-07T19:46:10.0177556Z 2025-05-07T19:46:10.0389227Z libcusparse-12.5.4.2 | 118.6 MB | #####8 | 58%  2025-05-07T19:46:10.0389582Z 2025-05-07T19:46:10.0505522Z libcublas-12.6.4.1 | 256.2 MB | ##5 | 25%  2025-05-07T19:46:10.0506333Z 2025-05-07T19:46:10.0506346Z 2025-05-07T19:46:10.0506357Z 2025-05-07T19:46:10.0506381Z 2025-05-07T19:46:10.0582582Z cuda-nsight-12.6.77 | 113.2 MB | #####3 | 54%  2025-05-07T19:46:10.0582925Z 2025-05-07T19:46:10.0582930Z 2025-05-07T19:46:10.1052168Z libcufft-11.3.0.4 | 156.2 MB | ####7 | 48%  2025-05-07T19:46:10.1518422Z nsight-compute-2024. | 443.1 MB | #1 | 11% 2025-05-07T19:46:10.1519093Z 2025-05-07T19:46:10.1519130Z 2025-05-07T19:46:10.1519143Z 2025-05-07T19:46:10.1519224Z 2025-05-07T19:46:10.1538521Z cuda-nsight-12.6.77 | 113.2 MB | #####8 | 59%  2025-05-07T19:46:10.1538876Z 2025-05-07T19:46:10.1538881Z 2025-05-07T19:46:10.1538884Z 2025-05-07T19:46:10.1619795Z libcusparse-12.5.4.2 | 118.6 MB | ######4 | 64%  2025-05-07T19:46:10.1620153Z 2025-05-07T19:46:10.2054118Z libcublas-12.6.4.1 | 256.2 MB | ##7 | 28%  2025-05-07T19:46:10.2082402Z nsight-compute-2024. | 443.1 MB | #2 | 12% 2025-05-07T19:46:10.2082711Z 2025-05-07T19:46:10.2082805Z 2025-05-07T19:46:10.2588342Z libcufft-11.3.0.4 | 156.2 MB | #####1 | 52%  2025-05-07T19:46:10.2588640Z 2025-05-07T19:46:10.2588683Z 2025-05-07T19:46:10.2588689Z 2025-05-07T19:46:10.2588694Z 2025-05-07T19:46:10.2777825Z cuda-nsight-12.6.77 | 113.2 MB | ######3 | 64%  2025-05-07T19:46:10.2778155Z 2025-05-07T19:46:10.2894917Z libcublas-12.6.4.1 | 256.2 MB | ##9 | 30%  2025-05-07T19:46:10.2895210Z 2025-05-07T19:46:10.2895375Z 2025-05-07T19:46:10.2895394Z 2025-05-07T19:46:10.3054402Z libcusparse-12.5.4.2 | 118.6 MB | ######9 | 70%  2025-05-07T19:46:10.3257486Z nsight-compute-2024. | 443.1 MB | #3 | 13% 2025-05-07T19:46:10.3257817Z 2025-05-07T19:46:10.3257957Z 2025-05-07T19:46:10.3591321Z libcufft-11.3.0.4 | 156.2 MB | #####5 | 55%  2025-05-07T19:46:10.3591621Z 2025-05-07T19:46:10.3591627Z 2025-05-07T19:46:10.3591645Z 2025-05-07T19:46:10.3592547Z 2025-05-07T19:46:10.3844996Z cuda-nsight-12.6.77 | 113.2 MB | ######8 | 68%  2025-05-07T19:46:10.3845310Z 2025-05-07T19:46:10.4055782Z libcublas-12.6.4.1 | 256.2 MB | ###1 | 32%  2025-05-07T19:46:10.4090020Z nsight-compute-2024. | 443.1 MB | #4 | 14% 2025-05-07T19:46:10.4090319Z 2025-05-07T19:46:10.4090475Z 2025-05-07T19:46:10.4090486Z 2025-05-07T19:46:10.4337936Z libcusparse-12.5.4.2 | 118.6 MB | #######4 | 75%  2025-05-07T19:46:10.4338282Z 2025-05-07T19:46:10.4338422Z 2025-05-07T19:46:10.4592570Z libcufft-11.3.0.4 | 156.2 MB | #####8 | 58%  2025-05-07T19:46:10.4592861Z 2025-05-07T19:46:10.4592867Z 2025-05-07T19:46:10.4592872Z 2025-05-07T19:46:10.4592878Z 2025-05-07T19:46:10.4845307Z cuda-nsight-12.6.77 | 113.2 MB | #######3 | 73%  2025-05-07T19:46:10.4845643Z 2025-05-07T19:46:10.5061104Z libcublas-12.6.4.1 | 256.2 MB | ###4 | 34%  2025-05-07T19:46:10.5219665Z nsight-compute-2024. | 443.1 MB | #5 | 15% 2025-05-07T19:46:10.5219997Z 2025-05-07T19:46:10.5220002Z 2025-05-07T19:46:10.5220006Z 2025-05-07T19:46:10.5343983Z libcusparse-12.5.4.2 | 118.6 MB | #######9 | 80%  2025-05-07T19:46:10.5344327Z 2025-05-07T19:46:10.5344332Z 2025-05-07T19:46:10.5663713Z libcufft-11.3.0.4 | 156.2 MB | ######1 | 61%  2025-05-07T19:46:10.5664557Z 2025-05-07T19:46:10.5664571Z 2025-05-07T19:46:10.5664582Z 2025-05-07T19:46:10.5664592Z 2025-05-07T19:46:10.5972987Z cuda-nsight-12.6.77 | 113.2 MB | #######7 | 78%  2025-05-07T19:46:10.5973307Z 2025-05-07T19:46:10.6062669Z libcublas-12.6.4.1 | 256.2 MB | ###6 | 36%  2025-05-07T19:46:10.6358414Z nsight-compute-2024. | 443.1 MB | #6 | 16% 2025-05-07T19:46:10.6358882Z 2025-05-07T19:46:10.6359195Z 2025-05-07T19:46:10.6359212Z 2025-05-07T19:46:10.6465334Z libcusparse-12.5.4.2 | 118.6 MB | ########4 | 84%  2025-05-07T19:46:10.6465655Z 2025-05-07T19:46:10.6466908Z 2025-05-07T19:46:10.6711728Z libcufft-11.3.0.4 | 156.2 MB | ######4 | 65%  2025-05-07T19:46:10.6712037Z 2025-05-07T19:46:10.6712044Z 2025-05-07T19:46:10.6712049Z 2025-05-07T19:46:10.6712055Z 2025-05-07T19:46:10.7002745Z cuda-nsight-12.6.77 | 113.2 MB | ########2 | 82%  2025-05-07T19:46:10.7003061Z 2025-05-07T19:46:10.7104865Z libcublas-12.6.4.1 | 256.2 MB | ###8 | 38%  2025-05-07T19:46:10.7374365Z nsight-compute-2024. | 443.1 MB | #7 | 17% 2025-05-07T19:46:10.7374703Z 2025-05-07T19:46:10.7374906Z 2025-05-07T19:46:10.7374918Z 2025-05-07T19:46:10.7642560Z libcusparse-12.5.4.2 | 118.6 MB | ########8 | 89%  2025-05-07T19:46:10.7642897Z 2025-05-07T19:46:10.7642903Z 2025-05-07T19:46:10.7731828Z libcufft-11.3.0.4 | 156.2 MB | ######7 | 68%  2025-05-07T19:46:10.7732148Z 2025-05-07T19:46:10.7732154Z 2025-05-07T19:46:10.7732157Z 2025-05-07T19:46:10.7732381Z 2025-05-07T19:46:10.8014950Z cuda-nsight-12.6.77 | 113.2 MB | ########6 | 87%  2025-05-07T19:46:10.8015271Z 2025-05-07T19:46:10.8136501Z libcublas-12.6.4.1 | 256.2 MB | #### | 40%  2025-05-07T19:46:10.8375998Z nsight-compute-2024. | 443.1 MB | #8 | 18% 2025-05-07T19:46:10.8376343Z 2025-05-07T19:46:10.8376534Z 2025-05-07T19:46:10.8376550Z 2025-05-07T19:46:10.8646644Z libcusparse-12.5.4.2 | 118.6 MB | #########3 | 93%  2025-05-07T19:46:10.8647558Z 2025-05-07T19:46:10.8647573Z 2025-05-07T19:46:10.8734622Z libcufft-11.3.0.4 | 156.2 MB | ####### | 71%  2025-05-07T19:46:10.8735089Z 2025-05-07T19:46:10.8735137Z 2025-05-07T19:46:10.8735156Z 2025-05-07T19:46:10.8735196Z 2025-05-07T19:46:10.9068927Z cuda-nsight-12.6.77 | 113.2 MB | #########1 | 92%  2025-05-07T19:46:10.9069815Z 2025-05-07T19:46:10.9141571Z libcublas-12.6.4.1 | 256.2 MB | ####2 | 42%  2025-05-07T19:46:10.9453033Z nsight-compute-2024. | 443.1 MB | #9 | 19% 2025-05-07T19:46:10.9453532Z 2025-05-07T19:46:10.9453551Z 2025-05-07T19:46:10.9453556Z 2025-05-07T19:46:10.9709052Z libcusparse-12.5.4.2 | 118.6 MB | #########7 | 98%  2025-05-07T19:46:10.9709402Z 2025-05-07T19:46:10.9709408Z 2025-05-07T19:46:10.9758722Z libcufft-11.3.0.4 | 156.2 MB | #######3 | 74%  2025-05-07T19:46:10.9760040Z 2025-05-07T19:46:10.9760053Z 2025-05-07T19:46:10.9760064Z 2025-05-07T19:46:10.9760074Z 2025-05-07T19:46:11.0067567Z cuda-nsight-12.6.77 | 113.2 MB | #########6 | 96%  2025-05-07T19:46:11.0067889Z 2025-05-07T19:46:11.0142773Z libcublas-12.6.4.1 | 256.2 MB | ####4 | 45%  2025-05-07T19:46:11.0707436Z nsight-compute-2024. | 443.1 MB | ## | 20% 2025-05-07T19:46:11.0707727Z 2025-05-07T19:46:11.0707733Z 2025-05-07T19:46:11.1068809Z libcufft-11.3.0.4 | 156.2 MB | #######7 | 78%  2025-05-07T19:46:11.1069124Z 2025-05-07T19:46:11.1290718Z libcublas-12.6.4.1 | 256.2 MB | ####7 | 48%  2025-05-07T19:46:11.1708565Z nsight-compute-2024. | 443.1 MB | ##1 | 21% 2025-05-07T19:46:11.1708871Z 2025-05-07T19:46:11.1708876Z 2025-05-07T19:46:11.2068937Z libcufft-11.3.0.4 | 156.2 MB | ########3 | 83%  2025-05-07T19:46:11.2069254Z 2025-05-07T19:46:11.2292186Z libcublas-12.6.4.1 | 256.2 MB | #####1 | 52%  2025-05-07T19:46:11.2710109Z nsight-compute-2024. | 443.1 MB | ##3 | 23% 2025-05-07T19:46:11.2710458Z 2025-05-07T19:46:11.2710463Z 2025-05-07T19:46:11.3069335Z libcufft-11.3.0.4 | 156.2 MB | ########8 | 89%  2025-05-07T19:46:11.3069639Z 2025-05-07T19:46:11.3293691Z libcublas-12.6.4.1 | 256.2 MB | #####4 | 55%  2025-05-07T19:46:11.3842818Z nsight-compute-2024. | 443.1 MB | ##4 | 25% 2025-05-07T19:46:11.3843384Z 2025-05-07T19:46:11.3843396Z 2025-05-07T19:46:11.4105449Z libcufft-11.3.0.4 | 156.2 MB | #########2 | 93%  2025-05-07T19:46:11.4105794Z 2025-05-07T19:46:11.4339743Z libcublas-12.6.4.1 | 256.2 MB | #####7 | 58%  2025-05-07T19:46:11.4883914Z nsight-compute-2024. | 443.1 MB | ##6 | 27% 2025-05-07T19:46:11.4884448Z 2025-05-07T19:46:11.4884470Z 2025-05-07T19:46:11.5702598Z libcufft-11.3.0.4 | 156.2 MB | #########7 | 97%  2025-05-07T19:46:11.5735421Z nsight-compute-2024. | 443.1 MB | ##8 | 28% 2025-05-07T19:46:11.5735862Z 2025-05-07T19:46:11.6644852Z libcublas-12.6.4.1 | 256.2 MB | ###### | 61%  2025-05-07T19:46:11.6645288Z 2025-05-07T19:46:11.6645345Z 2025-05-07T19:46:11.6645350Z 2025-05-07T19:46:11.6645355Z 2025-05-07T19:46:11.6790417Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:46:11.6790740Z 2025-05-07T19:46:11.6790747Z 2025-05-07T19:46:11.6791169Z 2025-05-07T19:46:11.7175227Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:46:11.7175573Z 2025-05-07T19:46:11.7175579Z 2025-05-07T19:46:11.7175618Z 2025-05-07T19:46:11.7175622Z 2025-05-07T19:46:11.7175626Z 2025-05-07T19:46:11.7175631Z 2025-05-07T19:46:11.7177662Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:46:11.7802958Z nsight-compute-2024. | 443.1 MB | ##9 | 30% 2025-05-07T19:46:11.7803733Z 2025-05-07T19:46:11.8176161Z libcublas-12.6.4.1 | 256.2 MB | ######3 | 63%  2025-05-07T19:46:11.8176509Z 2025-05-07T19:46:11.8176514Z 2025-05-07T19:46:11.8176518Z 2025-05-07T19:46:11.8176521Z 2025-05-07T19:46:11.8176524Z 2025-05-07T19:46:11.8176528Z 2025-05-07T19:46:11.8179205Z libcusolver-11.7.1.2 | 95.8 MB | 8 | 9%  2025-05-07T19:46:11.8220916Z nsight-compute-2024. | 443.1 MB | ### | 31% 2025-05-07T19:46:11.8221217Z 2025-05-07T19:46:11.8221222Z 2025-05-07T19:46:11.8221228Z 2025-05-07T19:46:11.8221233Z 2025-05-07T19:46:11.8221237Z 2025-05-07T19:46:11.8804060Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:46:11.8804404Z 2025-05-07T19:46:11.9181504Z libcublas-12.6.4.1 | 256.2 MB | ######6 | 66%  2025-05-07T19:46:11.9181817Z 2025-05-07T19:46:11.9181822Z 2025-05-07T19:46:11.9181828Z 2025-05-07T19:46:11.9181833Z 2025-05-07T19:46:11.9181838Z 2025-05-07T19:46:11.9181856Z 2025-05-07T19:46:11.9221190Z libcusolver-11.7.1.2 | 95.8 MB | #7 | 17%  2025-05-07T19:46:11.9221796Z 2025-05-07T19:46:11.9221801Z 2025-05-07T19:46:11.9221807Z 2025-05-07T19:46:11.9221811Z 2025-05-07T19:46:11.9221815Z 2025-05-07T19:46:11.9706689Z cuda-nvvp-12.6.80 | 109.3 MB | 3 | 4%  2025-05-07T19:46:11.9804401Z nsight-compute-2024. | 443.1 MB | ###2 | 32% 2025-05-07T19:46:11.9805228Z 2025-05-07T19:46:12.0181790Z libcublas-12.6.4.1 | 256.2 MB | ######8 | 69%  2025-05-07T19:46:12.0182094Z 2025-05-07T19:46:12.0182100Z 2025-05-07T19:46:12.0182104Z 2025-05-07T19:46:12.0182108Z 2025-05-07T19:46:12.0182125Z 2025-05-07T19:46:12.0182168Z 2025-05-07T19:46:12.0221473Z libcusolver-11.7.1.2 | 95.8 MB | ##4 | 24%  2025-05-07T19:46:12.0221789Z 2025-05-07T19:46:12.0221794Z 2025-05-07T19:46:12.0221799Z 2025-05-07T19:46:12.0221802Z 2025-05-07T19:46:12.0221807Z 2025-05-07T19:46:12.0769208Z cuda-nvvp-12.6.80 | 109.3 MB | 9 | 9%  2025-05-07T19:46:12.0901778Z nsight-compute-2024. | 443.1 MB | ###3 | 33% 2025-05-07T19:46:12.0902114Z 2025-05-07T19:46:12.1222107Z libcublas-12.6.4.1 | 256.2 MB | #######1 | 71%  2025-05-07T19:46:12.1222416Z 2025-05-07T19:46:12.1222422Z 2025-05-07T19:46:12.1222426Z 2025-05-07T19:46:12.1222443Z 2025-05-07T19:46:12.1222447Z 2025-05-07T19:46:12.1320453Z cuda-nvvp-12.6.80 | 109.3 MB | #3 | 14%  2025-05-07T19:46:12.1320775Z 2025-05-07T19:46:12.1320780Z 2025-05-07T19:46:12.1320784Z 2025-05-07T19:46:12.1320788Z 2025-05-07T19:46:12.1320792Z 2025-05-07T19:46:12.1320808Z 2025-05-07T19:46:12.1789418Z libcusolver-11.7.1.2 | 95.8 MB | ###1 | 31%  2025-05-07T19:46:12.2043189Z nsight-compute-2024. | 443.1 MB | ###4 | 35% 2025-05-07T19:46:12.2043617Z 2025-05-07T19:46:12.2222723Z libcublas-12.6.4.1 | 256.2 MB | #######3 | 74%  2025-05-07T19:46:12.2223021Z 2025-05-07T19:46:12.2223027Z 2025-05-07T19:46:12.2223032Z 2025-05-07T19:46:12.2223036Z 2025-05-07T19:46:12.2224000Z 2025-05-07T19:46:12.2461172Z cuda-nvvp-12.6.80 | 109.3 MB | #9 | 19%  2025-05-07T19:46:12.2461479Z 2025-05-07T19:46:12.2910372Z 2025-05-07T19:46:12.2910393Z 2025-05-07T19:46:12.2910409Z 2025-05-07T19:46:12.2910423Z 2025-05-07T19:46:12.2910438Z 2025-05-07T19:46:12.2911668Z libcusolver-11.7.1.2 | 95.8 MB | ###7 | 38%  2025-05-07T19:46:12.3124849Z nsight-compute-2024. | 443.1 MB | ###5 | 36% 2025-05-07T19:46:12.3125302Z 2025-05-07T19:46:12.3223116Z libcublas-12.6.4.1 | 256.2 MB | #######5 | 76%  2025-05-07T19:46:12.3223439Z 2025-05-07T19:46:12.3223473Z 2025-05-07T19:46:12.3223477Z 2025-05-07T19:46:12.3223481Z 2025-05-07T19:46:12.3223486Z 2025-05-07T19:46:12.3526470Z cuda-nvvp-12.6.80 | 109.3 MB | ##4 | 25%  2025-05-07T19:46:12.3527365Z 2025-05-07T19:46:12.3527379Z 2025-05-07T19:46:12.3527391Z 2025-05-07T19:46:12.3527402Z 2025-05-07T19:46:12.3527412Z 2025-05-07T19:46:12.3527443Z 2025-05-07T19:46:12.3912690Z libcusolver-11.7.1.2 | 95.8 MB | ####4 | 44%  2025-05-07T19:46:12.4189755Z nsight-compute-2024. | 443.1 MB | ###6 | 37% 2025-05-07T19:46:12.4190034Z 2025-05-07T19:46:12.4225751Z libcublas-12.6.4.1 | 256.2 MB | #######8 | 78%  2025-05-07T19:46:12.4226051Z 2025-05-07T19:46:12.4226056Z 2025-05-07T19:46:12.4226061Z 2025-05-07T19:46:12.4226066Z 2025-05-07T19:46:12.4226070Z 2025-05-07T19:46:12.4562028Z cuda-nvvp-12.6.80 | 109.3 MB | ##9 | 30%  2025-05-07T19:46:12.4562370Z 2025-05-07T19:46:12.4562376Z 2025-05-07T19:46:12.4562405Z 2025-05-07T19:46:12.4562410Z 2025-05-07T19:46:12.4562415Z 2025-05-07T19:46:12.4562419Z 2025-05-07T19:46:12.4934054Z libcusolver-11.7.1.2 | 95.8 MB | ##### | 50%  2025-05-07T19:46:12.5227311Z nsight-compute-2024. | 443.1 MB | ###7 | 38% 2025-05-07T19:46:12.5227629Z 2025-05-07T19:46:12.5227635Z 2025-05-07T19:46:12.5227642Z 2025-05-07T19:46:12.5227647Z 2025-05-07T19:46:12.5228158Z 2025-05-07T19:46:12.5263031Z cuda-nvvp-12.6.80 | 109.3 MB | ###4 | 35%  2025-05-07T19:46:12.5263368Z 2025-05-07T19:46:12.5645317Z libcublas-12.6.4.1 | 256.2 MB | ######## | 80%  2025-05-07T19:46:12.5646150Z 2025-05-07T19:46:12.5646167Z 2025-05-07T19:46:12.5646178Z 2025-05-07T19:46:12.5646188Z 2025-05-07T19:46:12.5646225Z 2025-05-07T19:46:12.5646235Z 2025-05-07T19:46:12.6265208Z libcusolver-11.7.1.2 | 95.8 MB | #####6 | 56%  2025-05-07T19:46:12.6265540Z 2025-05-07T19:46:12.6349574Z libcublas-12.6.4.1 | 256.2 MB | ########3 | 83%  2025-05-07T19:46:12.6646922Z nsight-compute-2024. | 443.1 MB | ###9 | 39% 2025-05-07T19:46:12.6647302Z 2025-05-07T19:46:12.6647398Z 2025-05-07T19:46:12.6647402Z 2025-05-07T19:46:12.6647405Z 2025-05-07T19:46:12.6647409Z 2025-05-07T19:46:12.6647412Z 2025-05-07T19:46:12.6883539Z libcusolver-11.7.1.2 | 95.8 MB | ######4 | 65%  2025-05-07T19:46:12.6883957Z 2025-05-07T19:46:12.6884003Z 2025-05-07T19:46:12.6884006Z 2025-05-07T19:46:12.6884010Z 2025-05-07T19:46:12.6884014Z 2025-05-07T19:46:12.7281171Z cuda-nvvp-12.6.80 | 109.3 MB | ###9 | 40%  2025-05-07T19:46:12.7281517Z 2025-05-07T19:46:12.7355744Z libcublas-12.6.4.1 | 256.2 MB | ########5 | 86%  2025-05-07T19:46:12.7683260Z nsight-compute-2024. | 443.1 MB | #### | 40% 2025-05-07T19:46:12.7683546Z 2025-05-07T19:46:12.7683551Z 2025-05-07T19:46:12.7683557Z 2025-05-07T19:46:12.7683646Z 2025-05-07T19:46:12.7683700Z 2025-05-07T19:46:12.7683709Z 2025-05-07T19:46:12.7882808Z libcusolver-11.7.1.2 | 95.8 MB | #######1 | 72%  2025-05-07T19:46:12.7883223Z 2025-05-07T19:46:12.7883229Z 2025-05-07T19:46:12.7883233Z 2025-05-07T19:46:12.7883238Z 2025-05-07T19:46:12.7883243Z 2025-05-07T19:46:12.8357458Z cuda-nvvp-12.6.80 | 109.3 MB | ####4 | 44%  2025-05-07T19:46:12.8431593Z nsight-compute-2024. | 443.1 MB | ####1 | 41% 2025-05-07T19:46:12.8432116Z 2025-05-07T19:46:12.8819536Z libcublas-12.6.4.1 | 256.2 MB | ########7 | 88%  2025-05-07T19:46:12.8819924Z 2025-05-07T19:46:12.8819929Z 2025-05-07T19:46:12.8819936Z 2025-05-07T19:46:12.8819942Z 2025-05-07T19:46:12.8819946Z 2025-05-07T19:46:12.8819954Z 2025-05-07T19:46:12.8885294Z libcusolver-11.7.1.2 | 95.8 MB | #######8 | 78%  2025-05-07T19:46:12.8885673Z 2025-05-07T19:46:12.8885680Z 2025-05-07T19:46:12.8885687Z 2025-05-07T19:46:12.8885693Z 2025-05-07T19:46:12.8885700Z 2025-05-07T19:46:12.9357713Z cuda-nvvp-12.6.80 | 109.3 MB | ####9 | 49%  2025-05-07T19:46:12.9449842Z nsight-compute-2024. | 443.1 MB | ####2 | 43% 2025-05-07T19:46:12.9450167Z 2025-05-07T19:46:12.9892356Z libcublas-12.6.4.1 | 256.2 MB | ######### | 90%  2025-05-07T19:46:12.9892700Z 2025-05-07T19:46:12.9892707Z 2025-05-07T19:46:12.9892733Z 2025-05-07T19:46:12.9892738Z 2025-05-07T19:46:12.9892744Z 2025-05-07T19:46:12.9947872Z cuda-nvvp-12.6.80 | 109.3 MB | #####4 | 55%  2025-05-07T19:46:12.9948216Z 2025-05-07T19:46:12.9948222Z 2025-05-07T19:46:12.9948238Z 2025-05-07T19:46:12.9948242Z 2025-05-07T19:46:12.9948245Z 2025-05-07T19:46:12.9948262Z 2025-05-07T19:46:13.0198981Z libcusolver-11.7.1.2 | 95.8 MB | ########4 | 85%  2025-05-07T19:46:13.0199321Z 2025-05-07T19:46:13.0199327Z 2025-05-07T19:46:13.0199434Z 2025-05-07T19:46:13.0199442Z 2025-05-07T19:46:13.0378444Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:46:13.0477723Z nsight-compute-2024. | 443.1 MB | ####3 | 44% 2025-05-07T19:46:13.0478201Z 2025-05-07T19:46:13.0892400Z libcublas-12.6.4.1 | 256.2 MB | #########2 | 92%  2025-05-07T19:46:13.0892717Z 2025-05-07T19:46:13.0892723Z 2025-05-07T19:46:13.0892728Z 2025-05-07T19:46:13.0892733Z 2025-05-07T19:46:13.0892738Z 2025-05-07T19:46:13.0962214Z cuda-nvvp-12.6.80 | 109.3 MB | ###### | 60%  2025-05-07T19:46:13.0962818Z 2025-05-07T19:46:13.0962824Z 2025-05-07T19:46:13.0962827Z 2025-05-07T19:46:13.0962832Z 2025-05-07T19:46:13.0962836Z 2025-05-07T19:46:13.0962839Z 2025-05-07T19:46:13.1381249Z libcusolver-11.7.1.2 | 95.8 MB | ######### | 91%  2025-05-07T19:46:13.1476889Z nsight-compute-2024. | 443.1 MB | ####4 | 45% 2025-05-07T19:46:13.1479634Z 2025-05-07T19:46:13.1892807Z libcublas-12.6.4.1 | 256.2 MB | #########4 | 95%  2025-05-07T19:46:13.1893411Z 2025-05-07T19:46:13.1893436Z 2025-05-07T19:46:13.1893498Z 2025-05-07T19:46:13.1893504Z 2025-05-07T19:46:13.1893556Z 2025-05-07T19:46:13.1965300Z cuda-nvvp-12.6.80 | 109.3 MB | ######6 | 66%  2025-05-07T19:46:13.1965657Z 2025-05-07T19:46:13.1965662Z 2025-05-07T19:46:13.1965665Z 2025-05-07T19:46:13.1965670Z 2025-05-07T19:46:13.1965673Z 2025-05-07T19:46:13.1965678Z 2025-05-07T19:46:13.2382129Z libcusolver-11.7.1.2 | 95.8 MB | #########7 | 97%  2025-05-07T19:46:13.2478284Z nsight-compute-2024. | 443.1 MB | ####6 | 46% 2025-05-07T19:46:13.2478817Z 2025-05-07T19:46:13.2894073Z libcublas-12.6.4.1 | 256.2 MB | #########6 | 97%  2025-05-07T19:46:13.2894394Z 2025-05-07T19:46:13.2894399Z 2025-05-07T19:46:13.2894403Z 2025-05-07T19:46:13.2894406Z 2025-05-07T19:46:13.2894410Z 2025-05-07T19:46:13.3382710Z cuda-nvvp-12.6.80 | 109.3 MB | #######1 | 72%  2025-05-07T19:46:13.3481087Z nsight-compute-2024. | 443.1 MB | ####7 | 48% 2025-05-07T19:46:13.3481895Z 2025-05-07T19:46:13.3894330Z libcublas-12.6.4.1 | 256.2 MB | #########9 | 100%  2025-05-07T19:46:13.3894642Z 2025-05-07T19:46:13.3894649Z 2025-05-07T19:46:13.3894654Z 2025-05-07T19:46:13.3894658Z 2025-05-07T19:46:13.3894661Z 2025-05-07T19:46:13.4384348Z cuda-nvvp-12.6.80 | 109.3 MB | #######9 | 80%  2025-05-07T19:46:13.4894767Z nsight-compute-2024. | 443.1 MB | ####8 | 49% 2025-05-07T19:46:13.4895094Z 2025-05-07T19:46:13.4895281Z 2025-05-07T19:46:13.4895321Z 2025-05-07T19:46:13.4895327Z 2025-05-07T19:46:13.4895333Z 2025-05-07T19:46:13.5620850Z cuda-nvvp-12.6.80 | 109.3 MB | #########4 | 95%  2025-05-07T19:46:13.5621185Z 2025-05-07T19:46:13.5621191Z 2025-05-07T19:46:13.5709350Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:46:13.6088571Z nsight-compute-2024. | 443.1 MB | ##### | 50% 2025-05-07T19:46:13.6088931Z 2025-05-07T19:46:13.6089056Z 2025-05-07T19:46:13.6089062Z 2025-05-07T19:46:13.6089089Z 2025-05-07T19:46:13.6089095Z 2025-05-07T19:46:13.6089120Z 2025-05-07T19:46:13.6089158Z 2025-05-07T19:46:13.6775123Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:46:13.7089189Z nsight-compute-2024. | 443.1 MB | #####1 | 51% 2025-05-07T19:46:13.7089521Z 2025-05-07T19:46:13.7089681Z 2025-05-07T19:46:13.7089691Z 2025-05-07T19:46:13.7089697Z 2025-05-07T19:46:13.7089702Z 2025-05-07T19:46:13.7089718Z 2025-05-07T19:46:13.7089722Z 2025-05-07T19:46:13.7852148Z libnpp-12.3.1.54 | 93.4 MB | 9 | 10%  2025-05-07T19:46:13.8089935Z nsight-compute-2024. | 443.1 MB | #####2 | 52% 2025-05-07T19:46:13.8090303Z 2025-05-07T19:46:13.8090312Z 2025-05-07T19:46:13.8090318Z 2025-05-07T19:46:13.8090325Z 2025-05-07T19:46:13.8090333Z 2025-05-07T19:46:13.8090340Z 2025-05-07T19:46:13.8090347Z 2025-05-07T19:46:13.8854457Z libnpp-12.3.1.54 | 93.4 MB | #9 | 19%  2025-05-07T19:46:13.9124618Z nsight-compute-2024. | 443.1 MB | #####4 | 55% 2025-05-07T19:46:13.9124932Z 2025-05-07T19:46:13.9125256Z 2025-05-07T19:46:13.9125272Z 2025-05-07T19:46:13.9125279Z 2025-05-07T19:46:13.9125285Z 2025-05-07T19:46:13.9125290Z 2025-05-07T19:46:13.9125296Z 2025-05-07T19:46:13.9892863Z libnpp-12.3.1.54 | 93.4 MB | ##6 | 27%  2025-05-07T19:46:14.0126115Z nsight-compute-2024. | 443.1 MB | #####5 | 56% 2025-05-07T19:46:14.0126540Z 2025-05-07T19:46:14.0126804Z 2025-05-07T19:46:14.0127099Z 2025-05-07T19:46:14.0127107Z 2025-05-07T19:46:14.0127116Z 2025-05-07T19:46:14.0127120Z 2025-05-07T19:46:14.0127125Z 2025-05-07T19:46:14.0798724Z libnpp-12.3.1.54 | 93.4 MB | ###7 | 38%  2025-05-07T19:46:14.0799113Z 2025-05-07T19:46:14.0799120Z 2025-05-07T19:46:14.0799126Z 2025-05-07T19:46:14.0799132Z 2025-05-07T19:46:14.0799136Z 2025-05-07T19:46:14.0799141Z 2025-05-07T19:46:14.0893510Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:14.1129512Z nsight-compute-2024. | 443.1 MB | #####7 | 57% 2025-05-07T19:46:14.1130179Z 2025-05-07T19:46:14.1130194Z 2025-05-07T19:46:14.1130201Z 2025-05-07T19:46:14.1130220Z 2025-05-07T19:46:14.1130226Z 2025-05-07T19:46:14.1130234Z 2025-05-07T19:46:14.1130239Z 2025-05-07T19:46:14.1148241Z libnpp-12.3.1.54 | 93.4 MB | ####7 | 47%  2025-05-07T19:46:14.1148579Z 2025-05-07T19:46:14.1148583Z 2025-05-07T19:46:14.1148588Z 2025-05-07T19:46:14.1148625Z 2025-05-07T19:46:14.1148631Z 2025-05-07T19:46:14.1148636Z 2025-05-07T19:46:14.1148641Z 2025-05-07T19:46:14.1148908Z 2025-05-07T19:46:14.2107409Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:46:14.2149238Z nsight-compute-2024. | 443.1 MB | #####8 | 59% 2025-05-07T19:46:14.2149560Z 2025-05-07T19:46:14.2149566Z 2025-05-07T19:46:14.2149572Z 2025-05-07T19:46:14.2149579Z 2025-05-07T19:46:14.2149585Z 2025-05-07T19:46:14.2149602Z 2025-05-07T19:46:14.2149610Z 2025-05-07T19:46:14.2149616Z 2025-05-07T19:46:14.2742787Z cuda-nvdisasm-12.6.7 | 47.6 MB | #3 | 14%  2025-05-07T19:46:14.2743184Z 2025-05-07T19:46:14.2743189Z 2025-05-07T19:46:14.2743193Z 2025-05-07T19:46:14.2743197Z 2025-05-07T19:46:14.2743201Z 2025-05-07T19:46:14.2743204Z 2025-05-07T19:46:14.2743208Z 2025-05-07T19:46:14.3136791Z libnpp-12.3.1.54 | 93.4 MB | #####6 | 56%  2025-05-07T19:46:14.3149509Z nsight-compute-2024. | 443.1 MB | ###### | 60% 2025-05-07T19:46:14.3149791Z 2025-05-07T19:46:14.3149796Z 2025-05-07T19:46:14.3149800Z 2025-05-07T19:46:14.3149803Z 2025-05-07T19:46:14.3149806Z 2025-05-07T19:46:14.3149810Z 2025-05-07T19:46:14.3149813Z 2025-05-07T19:46:14.3150544Z 2025-05-07T19:46:14.4061566Z cuda-nvdisasm-12.6.7 | 47.6 MB | ##7 | 28%  2025-05-07T19:46:14.4061950Z 2025-05-07T19:46:14.4061955Z 2025-05-07T19:46:14.4061960Z 2025-05-07T19:46:14.4061965Z 2025-05-07T19:46:14.4061969Z 2025-05-07T19:46:14.4061974Z 2025-05-07T19:46:14.4062010Z 2025-05-07T19:46:14.4151817Z libnpp-12.3.1.54 | 93.4 MB | ######4 | 64%  2025-05-07T19:46:14.4152128Z 2025-05-07T19:46:14.4152133Z 2025-05-07T19:46:14.4152138Z 2025-05-07T19:46:14.4152142Z 2025-05-07T19:46:14.4152146Z 2025-05-07T19:46:14.4152149Z 2025-05-07T19:46:14.4152154Z 2025-05-07T19:46:14.4169477Z 2025-05-07T19:46:14.4169943Z cuda-nvdisasm-12.6.7 | 47.6 MB | ####2 | 42%  2025-05-07T19:46:14.5156416Z nsight-compute-2024. | 443.1 MB | ######1 | 62% 2025-05-07T19:46:14.5156760Z 2025-05-07T19:46:14.5156765Z 2025-05-07T19:46:14.5156774Z 2025-05-07T19:46:14.5156779Z 2025-05-07T19:46:14.5156783Z 2025-05-07T19:46:14.5156788Z 2025-05-07T19:46:14.5156793Z 2025-05-07T19:46:14.5156798Z 2025-05-07T19:46:14.5181628Z cuda-nvdisasm-12.6.7 | 47.6 MB | #####6 | 57%  2025-05-07T19:46:14.5287536Z nsight-compute-2024. | 443.1 MB | ######3 | 63% 2025-05-07T19:46:14.5287996Z 2025-05-07T19:46:14.5288057Z 2025-05-07T19:46:14.5288063Z 2025-05-07T19:46:14.5288082Z 2025-05-07T19:46:14.5288087Z 2025-05-07T19:46:14.5288092Z 2025-05-07T19:46:14.5288159Z 2025-05-07T19:46:14.6158756Z libnpp-12.3.1.54 | 93.4 MB | #######1 | 71%  2025-05-07T19:46:14.6159127Z 2025-05-07T19:46:14.6159147Z 2025-05-07T19:46:14.6159153Z 2025-05-07T19:46:14.6159157Z 2025-05-07T19:46:14.6159163Z 2025-05-07T19:46:14.6159463Z 2025-05-07T19:46:14.6159467Z 2025-05-07T19:46:14.6159471Z 2025-05-07T19:46:14.6181789Z cuda-nvdisasm-12.6.7 | 47.6 MB | ####### | 70%  2025-05-07T19:46:14.6496463Z nsight-compute-2024. | 443.1 MB | ######4 | 65% 2025-05-07T19:46:14.6496767Z 2025-05-07T19:46:14.6496817Z 2025-05-07T19:46:14.6496822Z 2025-05-07T19:46:14.6496922Z 2025-05-07T19:46:14.6496926Z 2025-05-07T19:46:14.6496931Z 2025-05-07T19:46:14.6496937Z 2025-05-07T19:46:14.7158623Z libnpp-12.3.1.54 | 93.4 MB | #######8 | 78%  2025-05-07T19:46:14.7159021Z 2025-05-07T19:46:14.7159027Z 2025-05-07T19:46:14.7159032Z 2025-05-07T19:46:14.7159036Z 2025-05-07T19:46:14.7159043Z 2025-05-07T19:46:14.7159048Z 2025-05-07T19:46:14.7159053Z 2025-05-07T19:46:14.7159057Z 2025-05-07T19:46:14.7195506Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########4 | 84%  2025-05-07T19:46:14.7534393Z nsight-compute-2024. | 443.1 MB | ######6 | 66% 2025-05-07T19:46:14.7534892Z 2025-05-07T19:46:14.7535087Z 2025-05-07T19:46:14.7535091Z 2025-05-07T19:46:14.7535120Z 2025-05-07T19:46:14.7535151Z 2025-05-07T19:46:14.7654438Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:46:14.7654773Z 2025-05-07T19:46:14.7654778Z 2025-05-07T19:46:14.7654782Z 2025-05-07T19:46:14.7654786Z 2025-05-07T19:46:14.7654790Z 2025-05-07T19:46:14.7654794Z 2025-05-07T19:46:14.7654797Z 2025-05-07T19:46:14.7951468Z libnpp-12.3.1.54 | 93.4 MB | ########4 | 85%  2025-05-07T19:46:14.7951857Z 2025-05-07T19:46:14.7952161Z 2025-05-07T19:46:14.7952166Z 2025-05-07T19:46:14.7952170Z 2025-05-07T19:46:14.7952174Z 2025-05-07T19:46:14.7952179Z 2025-05-07T19:46:14.7952183Z 2025-05-07T19:46:14.7952188Z 2025-05-07T19:46:14.7952193Z 2025-05-07T19:46:14.8260067Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:46:14.8278522Z nsight-compute-2024. | 443.1 MB | ######7 | 68% 2025-05-07T19:46:14.8278865Z 2025-05-07T19:46:14.8278871Z 2025-05-07T19:46:14.8278875Z 2025-05-07T19:46:14.8278878Z 2025-05-07T19:46:14.8278882Z 2025-05-07T19:46:14.8278885Z 2025-05-07T19:46:14.8278904Z 2025-05-07T19:46:14.8278908Z 2025-05-07T19:46:14.8825837Z cuda-nvdisasm-12.6.7 | 47.6 MB | #########7 | 97%  2025-05-07T19:46:14.8826206Z 2025-05-07T19:46:14.8826212Z 2025-05-07T19:46:14.8826217Z 2025-05-07T19:46:14.8826221Z 2025-05-07T19:46:14.8826226Z 2025-05-07T19:46:14.8826232Z 2025-05-07T19:46:14.8826252Z 2025-05-07T19:46:14.8950494Z libnpp-12.3.1.54 | 93.4 MB | #########1 | 91%  2025-05-07T19:46:14.8950817Z 2025-05-07T19:46:14.8950823Z 2025-05-07T19:46:14.8950826Z 2025-05-07T19:46:14.8950830Z 2025-05-07T19:46:14.8950834Z 2025-05-07T19:46:14.8950837Z 2025-05-07T19:46:14.8950855Z 2025-05-07T19:46:14.8950859Z 2025-05-07T19:46:14.8950862Z 2025-05-07T19:46:14.9376174Z libcurand-10.3.7.77 | 39.9 MB | #2 | 13%  2025-05-07T19:46:14.9827532Z nsight-compute-2024. | 443.1 MB | ######8 | 69% 2025-05-07T19:46:14.9827831Z 2025-05-07T19:46:14.9827837Z 2025-05-07T19:46:14.9827842Z 2025-05-07T19:46:14.9827846Z 2025-05-07T19:46:14.9827850Z 2025-05-07T19:46:14.9827853Z 2025-05-07T19:46:14.9827866Z 2025-05-07T19:46:14.9950687Z libnpp-12.3.1.54 | 93.4 MB | #########7 | 98%  2025-05-07T19:46:14.9951022Z 2025-05-07T19:46:14.9951027Z 2025-05-07T19:46:14.9951031Z 2025-05-07T19:46:14.9951035Z 2025-05-07T19:46:14.9951040Z 2025-05-07T19:46:14.9951044Z 2025-05-07T19:46:14.9951081Z 2025-05-07T19:46:14.9951086Z 2025-05-07T19:46:14.9951090Z 2025-05-07T19:46:15.0905562Z libcurand-10.3.7.77 | 39.9 MB | ###1 | 31%  2025-05-07T19:46:15.0905922Z 2025-05-07T19:46:15.0905928Z 2025-05-07T19:46:15.0905947Z 2025-05-07T19:46:15.0953042Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:46:15.0953382Z 2025-05-07T19:46:15.0953647Z 2025-05-07T19:46:15.0953651Z 2025-05-07T19:46:15.0953654Z 2025-05-07T19:46:15.0953658Z 2025-05-07T19:46:15.0953662Z 2025-05-07T19:46:15.0953666Z 2025-05-07T19:46:15.0953669Z 2025-05-07T19:46:15.0953673Z 2025-05-07T19:46:15.1955145Z libcurand-10.3.7.77 | 39.9 MB | ##### | 50%  2025-05-07T19:46:15.1955541Z 2025-05-07T19:46:15.1955546Z 2025-05-07T19:46:15.1955551Z 2025-05-07T19:46:15.1955556Z 2025-05-07T19:46:15.1955561Z 2025-05-07T19:46:15.1955567Z 2025-05-07T19:46:15.1955572Z 2025-05-07T19:46:15.1955578Z 2025-05-07T19:46:15.1955583Z 2025-05-07T19:46:15.2415558Z libcurand-10.3.7.77 | 39.9 MB | #######8 | 78%  2025-05-07T19:46:15.3422415Z nsight-compute-2024. | 443.1 MB | ####### | 70% 2025-05-07T19:46:15.4487466Z nsight-compute-2024. | 443.1 MB | #######1 | 71% 2025-05-07T19:46:15.4627073Z nsight-compute-2024. | 443.1 MB | #######2 | 73% 2025-05-07T19:46:15.4627375Z 2025-05-07T19:46:15.4627381Z 2025-05-07T19:46:15.4627420Z 2025-05-07T19:46:15.4627424Z 2025-05-07T19:46:15.4627427Z 2025-05-07T19:46:15.4627431Z 2025-05-07T19:46:15.4627435Z 2025-05-07T19:46:15.4627439Z 2025-05-07T19:46:15.5130871Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:46:15.5131224Z 2025-05-07T19:46:15.5131253Z 2025-05-07T19:46:15.5131258Z 2025-05-07T19:46:15.5131262Z 2025-05-07T19:46:15.5131267Z 2025-05-07T19:46:15.5131271Z 2025-05-07T19:46:15.5131277Z 2025-05-07T19:46:15.5131283Z 2025-05-07T19:46:15.5131302Z 2025-05-07T19:46:15.5131306Z 2025-05-07T19:46:15.5739419Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:46:15.6131943Z nsight-compute-2024. | 443.1 MB | #######3 | 74% 2025-05-07T19:46:15.6132242Z 2025-05-07T19:46:15.6132247Z 2025-05-07T19:46:15.6132251Z 2025-05-07T19:46:15.6132254Z 2025-05-07T19:46:15.6132259Z 2025-05-07T19:46:15.6132265Z 2025-05-07T19:46:15.6132270Z 2025-05-07T19:46:15.6132278Z 2025-05-07T19:46:15.6132316Z 2025-05-07T19:46:15.6132319Z 2025-05-07T19:46:15.6762783Z gds-tools-1.11.1.6 | 37.8 MB | #4 | 15%  2025-05-07T19:46:15.7131097Z nsight-compute-2024. | 443.1 MB | #######5 | 75% 2025-05-07T19:46:15.7131421Z 2025-05-07T19:46:15.7131566Z 2025-05-07T19:46:15.7131576Z 2025-05-07T19:46:15.7131581Z 2025-05-07T19:46:15.7131588Z 2025-05-07T19:46:15.7131614Z 2025-05-07T19:46:15.7131619Z 2025-05-07T19:46:15.7131624Z 2025-05-07T19:46:15.7131628Z 2025-05-07T19:46:15.7131633Z 2025-05-07T19:46:15.7187957Z gds-tools-1.11.1.6 | 37.8 MB | ####1 | 42%  2025-05-07T19:46:15.7188277Z 2025-05-07T19:46:15.7188283Z 2025-05-07T19:46:15.7188288Z 2025-05-07T19:46:15.7188293Z 2025-05-07T19:46:15.7188296Z 2025-05-07T19:46:15.7188300Z 2025-05-07T19:46:15.7188303Z 2025-05-07T19:46:15.7188307Z 2025-05-07T19:46:15.7188335Z 2025-05-07T19:46:15.7191128Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:15.7191442Z 2025-05-07T19:46:15.7191454Z 2025-05-07T19:46:15.7191458Z 2025-05-07T19:46:15.7191461Z 2025-05-07T19:46:15.7191465Z 2025-05-07T19:46:15.7191469Z 2025-05-07T19:46:15.7191472Z 2025-05-07T19:46:15.7191488Z 2025-05-07T19:46:15.7191731Z 2025-05-07T19:46:15.7520400Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:15.7521275Z 2025-05-07T19:46:15.7521282Z 2025-05-07T19:46:15.7521288Z 2025-05-07T19:46:15.7521293Z 2025-05-07T19:46:15.7521325Z 2025-05-07T19:46:15.7521333Z 2025-05-07T19:46:15.7521383Z 2025-05-07T19:46:15.7521388Z 2025-05-07T19:46:15.7521393Z 2025-05-07T19:46:15.7521398Z 2025-05-07T19:46:15.7521403Z 2025-05-07T19:46:15.8133013Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:46:15.8133649Z 2025-05-07T19:46:15.8133663Z 2025-05-07T19:46:15.8133669Z 2025-05-07T19:46:15.8133675Z 2025-05-07T19:46:15.8133681Z 2025-05-07T19:46:15.8133686Z 2025-05-07T19:46:15.8133964Z 2025-05-07T19:46:15.8133970Z 2025-05-07T19:46:15.8133975Z 2025-05-07T19:46:15.8133982Z 2025-05-07T19:46:15.8482881Z gds-tools-1.11.1.6 | 37.8 MB | ######6 | 66%  2025-05-07T19:46:15.8520044Z nsight-compute-2024. | 443.1 MB | #######6 | 76% 2025-05-07T19:46:15.8520382Z 2025-05-07T19:46:15.8520387Z 2025-05-07T19:46:15.8520393Z 2025-05-07T19:46:15.8520398Z 2025-05-07T19:46:15.8520402Z 2025-05-07T19:46:15.8520407Z 2025-05-07T19:46:15.8520413Z 2025-05-07T19:46:15.8520419Z 2025-05-07T19:46:15.8520423Z 2025-05-07T19:46:15.8520468Z 2025-05-07T19:46:15.8520473Z 2025-05-07T19:46:15.9133635Z cuda-nvcc-tools-12.6 | 23.0 MB | ###5 | 35%  2025-05-07T19:46:15.9133984Z 2025-05-07T19:46:15.9134125Z 2025-05-07T19:46:15.9134130Z 2025-05-07T19:46:15.9134307Z 2025-05-07T19:46:15.9134319Z 2025-05-07T19:46:15.9134325Z 2025-05-07T19:46:15.9134331Z 2025-05-07T19:46:15.9134336Z 2025-05-07T19:46:15.9134379Z 2025-05-07T19:46:15.9134384Z 2025-05-07T19:46:15.9522263Z gds-tools-1.11.1.6 | 37.8 MB | ########7 | 88%  2025-05-07T19:46:15.9522674Z 2025-05-07T19:46:15.9522680Z 2025-05-07T19:46:15.9522685Z 2025-05-07T19:46:15.9522689Z 2025-05-07T19:46:15.9522694Z 2025-05-07T19:46:15.9522699Z 2025-05-07T19:46:15.9522702Z 2025-05-07T19:46:15.9522707Z 2025-05-07T19:46:15.9522712Z 2025-05-07T19:46:15.9522716Z 2025-05-07T19:46:15.9522721Z 2025-05-07T19:46:15.9540334Z cuda-nvcc-tools-12.6 | 23.0 MB | #######1 | 72%  2025-05-07T19:46:16.0896717Z nsight-compute-2024. | 443.1 MB | #######7 | 77% 2025-05-07T19:46:16.0897073Z 2025-05-07T19:46:16.0897078Z 2025-05-07T19:46:16.0897113Z 2025-05-07T19:46:16.0897118Z 2025-05-07T19:46:16.0897138Z 2025-05-07T19:46:16.0897142Z 2025-05-07T19:46:16.0897147Z 2025-05-07T19:46:16.1342458Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:46:16.1342783Z 2025-05-07T19:46:16.1342825Z 2025-05-07T19:46:16.1342829Z 2025-05-07T19:46:16.1342833Z 2025-05-07T19:46:16.1342836Z 2025-05-07T19:46:16.1342854Z 2025-05-07T19:46:16.1342858Z 2025-05-07T19:46:16.1342861Z 2025-05-07T19:46:16.1342865Z 2025-05-07T19:46:16.1342868Z 2025-05-07T19:46:16.1342872Z 2025-05-07T19:46:16.1342875Z 2025-05-07T19:46:16.1427114Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:46:16.2264894Z nsight-compute-2024. | 443.1 MB | #######8 | 78% 2025-05-07T19:46:16.2265191Z 2025-05-07T19:46:16.2265197Z 2025-05-07T19:46:16.2345144Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:46:16.2345980Z 2025-05-07T19:46:16.2345994Z 2025-05-07T19:46:16.2346006Z 2025-05-07T19:46:16.2346016Z 2025-05-07T19:46:16.2346028Z 2025-05-07T19:46:16.2346039Z 2025-05-07T19:46:16.2346050Z 2025-05-07T19:46:16.2346061Z 2025-05-07T19:46:16.2346071Z 2025-05-07T19:46:16.2346081Z 2025-05-07T19:46:16.2346092Z 2025-05-07T19:46:16.2346176Z 2025-05-07T19:46:16.2430832Z cuda-nvrtc-12.6.85 | 17.3 MB | ####2 | 42%  2025-05-07T19:46:16.2865871Z nsight-compute-2024. | 443.1 MB | #######9 | 79% 2025-05-07T19:46:16.2866201Z 2025-05-07T19:46:16.2866206Z 2025-05-07T19:46:16.2866210Z 2025-05-07T19:46:16.2866213Z 2025-05-07T19:46:16.2866217Z 2025-05-07T19:46:16.2866220Z 2025-05-07T19:46:16.2866224Z 2025-05-07T19:46:16.2866406Z 2025-05-07T19:46:16.2866420Z 2025-05-07T19:46:16.2866426Z 2025-05-07T19:46:16.2866433Z 2025-05-07T19:46:16.2867280Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:16.2867635Z 2025-05-07T19:46:16.2867638Z 2025-05-07T19:46:16.2867642Z 2025-05-07T19:46:16.2867645Z 2025-05-07T19:46:16.2867649Z 2025-05-07T19:46:16.2867652Z 2025-05-07T19:46:16.2867671Z 2025-05-07T19:46:16.2867675Z 2025-05-07T19:46:16.2867680Z 2025-05-07T19:46:16.2867684Z 2025-05-07T19:46:16.2867688Z 2025-05-07T19:46:16.3186283Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:16.3186936Z 2025-05-07T19:46:16.3186941Z 2025-05-07T19:46:16.3186945Z 2025-05-07T19:46:16.3186948Z 2025-05-07T19:46:16.3186952Z 2025-05-07T19:46:16.3186955Z 2025-05-07T19:46:16.3186959Z 2025-05-07T19:46:16.3186962Z 2025-05-07T19:46:16.3186966Z 2025-05-07T19:46:16.3186983Z 2025-05-07T19:46:16.3186986Z 2025-05-07T19:46:16.3186989Z 2025-05-07T19:46:16.3186993Z 2025-05-07T19:46:16.3345444Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:46:16.3345844Z 2025-05-07T19:46:16.3345850Z 2025-05-07T19:46:16.3345867Z 2025-05-07T19:46:16.3345871Z 2025-05-07T19:46:16.3345876Z 2025-05-07T19:46:16.3345879Z 2025-05-07T19:46:16.3345884Z 2025-05-07T19:46:16.3345887Z 2025-05-07T19:46:16.3345891Z 2025-05-07T19:46:16.3345894Z 2025-05-07T19:46:16.3345898Z 2025-05-07T19:46:16.3345901Z 2025-05-07T19:46:16.3441955Z cuda-nvrtc-12.6.85 | 17.3 MB | #######2 | 73%  2025-05-07T19:46:16.4186311Z nsight-compute-2024. | 443.1 MB | ######## | 81% 2025-05-07T19:46:16.4186627Z 2025-05-07T19:46:16.4186663Z 2025-05-07T19:46:16.4186668Z 2025-05-07T19:46:16.4186673Z 2025-05-07T19:46:16.4186677Z 2025-05-07T19:46:16.4186683Z 2025-05-07T19:46:16.4186687Z 2025-05-07T19:46:16.4186707Z 2025-05-07T19:46:16.4186712Z 2025-05-07T19:46:16.4186717Z 2025-05-07T19:46:16.4186722Z 2025-05-07T19:46:16.4186728Z 2025-05-07T19:46:16.4186733Z 2025-05-07T19:46:16.4332032Z libnvjitlink-12.6.85 | 14.9 MB | ##### | 50%  2025-05-07T19:46:16.4332427Z 2025-05-07T19:46:16.4332432Z 2025-05-07T19:46:16.4332452Z 2025-05-07T19:46:16.4332456Z 2025-05-07T19:46:16.4332460Z 2025-05-07T19:46:16.4332465Z 2025-05-07T19:46:16.4332469Z 2025-05-07T19:46:16.4332474Z 2025-05-07T19:46:16.4332479Z 2025-05-07T19:46:16.4332483Z 2025-05-07T19:46:16.4877830Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:16.4878251Z 2025-05-07T19:46:16.4878298Z 2025-05-07T19:46:16.4878302Z 2025-05-07T19:46:16.4878306Z 2025-05-07T19:46:16.4878310Z 2025-05-07T19:46:16.4878314Z 2025-05-07T19:46:16.4878317Z 2025-05-07T19:46:16.4878321Z 2025-05-07T19:46:16.4878325Z 2025-05-07T19:46:16.4878329Z 2025-05-07T19:46:16.4878332Z 2025-05-07T19:46:16.4878335Z 2025-05-07T19:46:16.4878339Z 2025-05-07T19:46:16.4878343Z 2025-05-07T19:46:16.5129177Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:46:16.5421221Z nsight-compute-2024. | 443.1 MB | ########1 | 82% 2025-05-07T19:46:16.5421546Z 2025-05-07T19:46:16.5421552Z 2025-05-07T19:46:16.5421555Z 2025-05-07T19:46:16.5421574Z 2025-05-07T19:46:16.5421578Z 2025-05-07T19:46:16.5421581Z 2025-05-07T19:46:16.5879125Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:16.5879510Z 2025-05-07T19:46:16.5879516Z 2025-05-07T19:46:16.5879556Z 2025-05-07T19:46:16.5879560Z 2025-05-07T19:46:16.5879580Z 2025-05-07T19:46:16.5879585Z 2025-05-07T19:46:16.5879588Z 2025-05-07T19:46:16.5879591Z 2025-05-07T19:46:16.5879595Z 2025-05-07T19:46:16.5879598Z 2025-05-07T19:46:16.5879602Z 2025-05-07T19:46:16.5879605Z 2025-05-07T19:46:16.5879609Z 2025-05-07T19:46:16.5879612Z 2025-05-07T19:46:16.5928859Z cuda-nvcc-dev_linux- | 10.8 MB | ###7 | 38%  2025-05-07T19:46:16.5929233Z 2025-05-07T19:46:16.6016554Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:16.6016898Z 2025-05-07T19:46:16.6016903Z 2025-05-07T19:46:16.6016907Z 2025-05-07T19:46:16.6016910Z 2025-05-07T19:46:16.6016914Z 2025-05-07T19:46:16.6016933Z 2025-05-07T19:46:16.6016936Z 2025-05-07T19:46:16.6016940Z 2025-05-07T19:46:16.6016943Z 2025-05-07T19:46:16.6016946Z 2025-05-07T19:46:16.6016950Z 2025-05-07T19:46:16.6016953Z 2025-05-07T19:46:16.6130051Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:16.6350386Z nsight-compute-2024. | 443.1 MB | ########3 | 83% 2025-05-07T19:46:16.6350688Z 2025-05-07T19:46:16.6350693Z 2025-05-07T19:46:16.6350698Z 2025-05-07T19:46:16.6350703Z 2025-05-07T19:46:16.6350708Z 2025-05-07T19:46:16.6350713Z 2025-05-07T19:46:16.6350719Z 2025-05-07T19:46:16.6350723Z 2025-05-07T19:46:16.6350729Z 2025-05-07T19:46:16.6350734Z 2025-05-07T19:46:16.6350755Z 2025-05-07T19:46:16.6350760Z 2025-05-07T19:46:16.6350765Z 2025-05-07T19:46:16.6351252Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:16.6351580Z 2025-05-07T19:46:16.6351585Z 2025-05-07T19:46:16.6351589Z 2025-05-07T19:46:16.6351592Z 2025-05-07T19:46:16.6351595Z 2025-05-07T19:46:16.6351599Z 2025-05-07T19:46:16.6351602Z 2025-05-07T19:46:16.6351619Z 2025-05-07T19:46:16.6604073Z 2025-05-07T19:46:16.6604087Z 2025-05-07T19:46:16.6604092Z 2025-05-07T19:46:16.6604098Z 2025-05-07T19:46:16.6604103Z 2025-05-07T19:46:16.6604821Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:16.6605191Z 2025-05-07T19:46:16.6605196Z 2025-05-07T19:46:16.6605200Z 2025-05-07T19:46:16.6605204Z 2025-05-07T19:46:16.6605207Z 2025-05-07T19:46:16.6605211Z 2025-05-07T19:46:16.6605214Z 2025-05-07T19:46:16.6605218Z 2025-05-07T19:46:16.6661406Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:46:16.6661742Z 2025-05-07T19:46:16.6661747Z 2025-05-07T19:46:16.6661751Z 2025-05-07T19:46:16.6661755Z 2025-05-07T19:46:16.6661991Z 2025-05-07T19:46:16.6661996Z 2025-05-07T19:46:16.6661999Z 2025-05-07T19:46:16.6662003Z 2025-05-07T19:46:16.6662006Z 2025-05-07T19:46:16.6662023Z 2025-05-07T19:46:16.6662027Z 2025-05-07T19:46:16.6662032Z 2025-05-07T19:46:16.6662035Z 2025-05-07T19:46:16.6662051Z 2025-05-07T19:46:16.6662054Z 2025-05-07T19:46:16.6662057Z 2025-05-07T19:46:16.6663286Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:46:16.6663664Z 2025-05-07T19:46:16.6663668Z 2025-05-07T19:46:16.6663671Z 2025-05-07T19:46:16.6663675Z 2025-05-07T19:46:16.6663678Z 2025-05-07T19:46:16.6663681Z 2025-05-07T19:46:16.6663685Z 2025-05-07T19:46:16.6663688Z 2025-05-07T19:46:16.6663691Z 2025-05-07T19:46:16.6663695Z 2025-05-07T19:46:16.6663698Z 2025-05-07T19:46:16.6663701Z 2025-05-07T19:46:16.6663705Z 2025-05-07T19:46:16.6663708Z 2025-05-07T19:46:16.6663711Z 2025-05-07T19:46:16.6663714Z 2025-05-07T19:46:16.6664559Z 2025-05-07T19:46:16.6677139Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:46:16.6677680Z 2025-05-07T19:46:16.6678130Z 2025-05-07T19:46:16.6678201Z 2025-05-07T19:46:16.6678223Z 2025-05-07T19:46:16.6678239Z 2025-05-07T19:46:16.6678243Z 2025-05-07T19:46:16.6678247Z 2025-05-07T19:46:16.6678282Z 2025-05-07T19:46:16.6678286Z 2025-05-07T19:46:16.6678319Z 2025-05-07T19:46:16.6678350Z 2025-05-07T19:46:16.6678390Z 2025-05-07T19:46:16.6678395Z 2025-05-07T19:46:16.6678399Z 2025-05-07T19:46:16.6678419Z 2025-05-07T19:46:16.6911523Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:46:16.6911887Z 2025-05-07T19:46:16.6911891Z 2025-05-07T19:46:16.6911895Z 2025-05-07T19:46:16.6911913Z 2025-05-07T19:46:16.6911916Z 2025-05-07T19:46:16.6911921Z 2025-05-07T19:46:16.6911926Z 2025-05-07T19:46:16.6911929Z 2025-05-07T19:46:16.6911946Z 2025-05-07T19:46:16.6911950Z 2025-05-07T19:46:16.6911955Z 2025-05-07T19:46:16.6911980Z 2025-05-07T19:46:16.6911983Z 2025-05-07T19:46:16.6911988Z 2025-05-07T19:46:16.7430019Z cuda-nvcc-dev_linux- | 10.8 MB | #######9 | 79%  2025-05-07T19:46:16.7661875Z nsight-compute-2024. | 443.1 MB | ########4 | 85% 2025-05-07T19:46:16.7662181Z 2025-05-07T19:46:16.7662187Z 2025-05-07T19:46:16.7662207Z 2025-05-07T19:46:16.7662213Z 2025-05-07T19:46:16.7662488Z 2025-05-07T19:46:16.7662491Z 2025-05-07T19:46:16.7662495Z 2025-05-07T19:46:16.7662498Z 2025-05-07T19:46:16.7662502Z 2025-05-07T19:46:16.7662506Z 2025-05-07T19:46:16.7662509Z 2025-05-07T19:46:16.7662513Z 2025-05-07T19:46:16.7662518Z 2025-05-07T19:46:16.7662521Z 2025-05-07T19:46:16.7662525Z 2025-05-07T19:46:16.7662528Z 2025-05-07T19:46:16.7678889Z cuda-sanitizer-api-1 | 8.9 MB | #6 | 17%  2025-05-07T19:46:16.7679293Z 2025-05-07T19:46:16.7679298Z 2025-05-07T19:46:16.7679302Z 2025-05-07T19:46:16.7679306Z 2025-05-07T19:46:16.7679342Z 2025-05-07T19:46:16.7679346Z 2025-05-07T19:46:16.7679349Z 2025-05-07T19:46:16.7679353Z 2025-05-07T19:46:16.7679356Z 2025-05-07T19:46:16.7679360Z 2025-05-07T19:46:16.7679364Z 2025-05-07T19:46:16.7679367Z 2025-05-07T19:46:16.7679370Z 2025-05-07T19:46:16.7679374Z 2025-05-07T19:46:16.7679377Z 2025-05-07T19:46:16.8252545Z cuda-nvvm-tools-12.6 | 10.4 MB | ######8 | 68%  2025-05-07T19:46:16.8252955Z 2025-05-07T19:46:16.8252960Z 2025-05-07T19:46:16.8252964Z 2025-05-07T19:46:16.8252968Z 2025-05-07T19:46:16.8252971Z 2025-05-07T19:46:16.8252975Z 2025-05-07T19:46:16.8252978Z 2025-05-07T19:46:16.8252982Z 2025-05-07T19:46:16.8252986Z 2025-05-07T19:46:16.8253002Z 2025-05-07T19:46:16.8253005Z 2025-05-07T19:46:16.8253009Z 2025-05-07T19:46:16.8253012Z 2025-05-07T19:46:16.8253016Z 2025-05-07T19:46:16.8253036Z 2025-05-07T19:46:16.8253041Z 2025-05-07T19:46:16.8253044Z 2025-05-07T19:46:16.8253611Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:16.8253990Z 2025-05-07T19:46:16.8253994Z 2025-05-07T19:46:16.8253997Z 2025-05-07T19:46:16.8254001Z 2025-05-07T19:46:16.8254004Z 2025-05-07T19:46:16.8254008Z 2025-05-07T19:46:16.8254012Z 2025-05-07T19:46:16.8254015Z 2025-05-07T19:46:16.8254018Z 2025-05-07T19:46:16.8254022Z 2025-05-07T19:46:16.8254025Z 2025-05-07T19:46:16.8254034Z 2025-05-07T19:46:16.8254038Z 2025-05-07T19:46:16.8254041Z 2025-05-07T19:46:16.8254070Z 2025-05-07T19:46:16.8254073Z 2025-05-07T19:46:16.8254076Z 2025-05-07T19:46:16.8268430Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:16.8268775Z 2025-05-07T19:46:16.8268779Z 2025-05-07T19:46:16.8268782Z 2025-05-07T19:46:16.8268799Z 2025-05-07T19:46:16.8268829Z 2025-05-07T19:46:16.8268833Z 2025-05-07T19:46:16.8268836Z 2025-05-07T19:46:16.8268840Z 2025-05-07T19:46:16.8268844Z 2025-05-07T19:46:16.8268847Z 2025-05-07T19:46:16.8268860Z 2025-05-07T19:46:16.8268864Z 2025-05-07T19:46:16.8268867Z 2025-05-07T19:46:16.8269207Z 2025-05-07T19:46:16.8582428Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:16.8582874Z 2025-05-07T19:46:16.8582881Z 2025-05-07T19:46:16.8582887Z 2025-05-07T19:46:16.8582893Z 2025-05-07T19:46:16.8582896Z 2025-05-07T19:46:16.8582901Z 2025-05-07T19:46:16.8582938Z 2025-05-07T19:46:16.8582943Z 2025-05-07T19:46:16.8582946Z 2025-05-07T19:46:16.8582950Z 2025-05-07T19:46:16.8582954Z 2025-05-07T19:46:16.8582958Z 2025-05-07T19:46:16.8582962Z 2025-05-07T19:46:16.8582966Z 2025-05-07T19:46:16.8582969Z 2025-05-07T19:46:16.8582973Z 2025-05-07T19:46:16.8582977Z 2025-05-07T19:46:16.8582981Z 2025-05-07T19:46:16.8656018Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:46:16.8656406Z 2025-05-07T19:46:16.8656412Z 2025-05-07T19:46:16.8656415Z 2025-05-07T19:46:16.8656420Z 2025-05-07T19:46:16.8656443Z 2025-05-07T19:46:16.8656446Z 2025-05-07T19:46:16.8656450Z 2025-05-07T19:46:16.8656453Z 2025-05-07T19:46:16.8656457Z 2025-05-07T19:46:16.8656485Z 2025-05-07T19:46:16.8656536Z 2025-05-07T19:46:16.8656540Z 2025-05-07T19:46:16.8656571Z 2025-05-07T19:46:16.8656575Z 2025-05-07T19:46:16.8656607Z 2025-05-07T19:46:16.8656740Z 2025-05-07T19:46:16.8656749Z 2025-05-07T19:46:16.8656754Z 2025-05-07T19:46:16.8657001Z 2025-05-07T19:46:16.8668693Z ... (more hidden) ... 2025-05-07T19:46:16.8669178Z 2025-05-07T19:46:16.8669182Z 2025-05-07T19:46:16.8669186Z 2025-05-07T19:46:16.8669190Z 2025-05-07T19:46:16.8669193Z 2025-05-07T19:46:16.8669197Z 2025-05-07T19:46:16.8669200Z 2025-05-07T19:46:16.8669204Z 2025-05-07T19:46:16.8669208Z 2025-05-07T19:46:16.8669211Z 2025-05-07T19:46:16.8669215Z 2025-05-07T19:46:16.8669219Z 2025-05-07T19:46:16.8669234Z 2025-05-07T19:46:16.8669238Z 2025-05-07T19:46:16.8669241Z 2025-05-07T19:46:16.8669489Z 2025-05-07T19:46:16.8689024Z cuda-sanitizer-api-1 | 8.9 MB | ###2 | 32%  2025-05-07T19:46:16.9529566Z nsight-compute-2024. | 443.1 MB | ########5 | 86% 2025-05-07T19:46:16.9529870Z 2025-05-07T19:46:16.9529943Z 2025-05-07T19:46:16.9529949Z 2025-05-07T19:46:16.9529952Z 2025-05-07T19:46:16.9529956Z 2025-05-07T19:46:16.9529961Z 2025-05-07T19:46:16.9529999Z 2025-05-07T19:46:16.9530003Z 2025-05-07T19:46:16.9530007Z 2025-05-07T19:46:16.9530011Z 2025-05-07T19:46:16.9530016Z 2025-05-07T19:46:16.9530020Z 2025-05-07T19:46:16.9530024Z 2025-05-07T19:46:16.9530028Z 2025-05-07T19:46:16.9530031Z 2025-05-07T19:46:16.9530035Z 2025-05-07T19:46:16.9530039Z 2025-05-07T19:46:16.9530043Z 2025-05-07T19:46:16.9530047Z 2025-05-07T19:46:16.9603583Z ... (more hidden) ... 2025-05-07T19:46:16.9603935Z 2025-05-07T19:46:16.9603940Z 2025-05-07T19:46:16.9603944Z 2025-05-07T19:46:16.9604203Z 2025-05-07T19:46:16.9604208Z 2025-05-07T19:46:16.9604211Z 2025-05-07T19:46:16.9604215Z 2025-05-07T19:46:16.9604218Z 2025-05-07T19:46:16.9604222Z 2025-05-07T19:46:16.9604250Z 2025-05-07T19:46:16.9604254Z 2025-05-07T19:46:16.9604257Z 2025-05-07T19:46:16.9604261Z 2025-05-07T19:46:16.9604264Z 2025-05-07T19:46:16.9604267Z 2025-05-07T19:46:16.9604271Z 2025-05-07T19:46:16.9604274Z 2025-05-07T19:46:16.9604284Z 2025-05-07T19:46:16.9604652Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:16.9605031Z 2025-05-07T19:46:16.9605035Z 2025-05-07T19:46:16.9605038Z 2025-05-07T19:46:16.9605042Z 2025-05-07T19:46:16.9605045Z 2025-05-07T19:46:16.9605048Z 2025-05-07T19:46:16.9605052Z 2025-05-07T19:46:16.9605055Z 2025-05-07T19:46:16.9605058Z 2025-05-07T19:46:16.9605062Z 2025-05-07T19:46:16.9605065Z 2025-05-07T19:46:16.9605069Z 2025-05-07T19:46:16.9605072Z 2025-05-07T19:46:16.9605076Z 2025-05-07T19:46:16.9605079Z 2025-05-07T19:46:16.9605087Z 2025-05-07T19:46:16.9605091Z 2025-05-07T19:46:16.9605094Z 2025-05-07T19:46:16.9690172Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:16.9690568Z 2025-05-07T19:46:16.9690574Z 2025-05-07T19:46:16.9690577Z 2025-05-07T19:46:16.9690581Z 2025-05-07T19:46:16.9690584Z 2025-05-07T19:46:16.9690588Z 2025-05-07T19:46:16.9690609Z 2025-05-07T19:46:16.9690613Z 2025-05-07T19:46:16.9690616Z 2025-05-07T19:46:16.9690619Z 2025-05-07T19:46:16.9690623Z 2025-05-07T19:46:16.9690626Z 2025-05-07T19:46:16.9690630Z 2025-05-07T19:46:16.9690658Z 2025-05-07T19:46:16.9690661Z 2025-05-07T19:46:16.9729242Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:17.0481078Z nsight-compute-2024. | 443.1 MB | ########6 | 87% 2025-05-07T19:46:17.0481378Z 2025-05-07T19:46:17.0481383Z 2025-05-07T19:46:17.0481388Z 2025-05-07T19:46:17.0481392Z 2025-05-07T19:46:17.0481431Z 2025-05-07T19:46:17.0481437Z 2025-05-07T19:46:17.0481441Z 2025-05-07T19:46:17.0481447Z 2025-05-07T19:46:17.0481451Z 2025-05-07T19:46:17.0481456Z 2025-05-07T19:46:17.0481461Z 2025-05-07T19:46:17.0481465Z 2025-05-07T19:46:17.0481469Z 2025-05-07T19:46:17.0481497Z 2025-05-07T19:46:17.0481502Z 2025-05-07T19:46:17.0481505Z 2025-05-07T19:46:17.0482048Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:17.0482656Z 2025-05-07T19:46:17.0482660Z 2025-05-07T19:46:17.0482663Z 2025-05-07T19:46:17.0482667Z 2025-05-07T19:46:17.0482670Z 2025-05-07T19:46:17.0482673Z 2025-05-07T19:46:17.0482677Z 2025-05-07T19:46:17.0482707Z 2025-05-07T19:46:17.0482711Z 2025-05-07T19:46:17.0482714Z 2025-05-07T19:46:17.0482718Z 2025-05-07T19:46:17.0482721Z 2025-05-07T19:46:17.0482724Z 2025-05-07T19:46:17.0482728Z 2025-05-07T19:46:17.0482731Z 2025-05-07T19:46:17.0482734Z 2025-05-07T19:46:17.0733223Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:17.0779136Z nsight-compute-2024. | 443.1 MB | ########8 | 88% 2025-05-07T19:46:17.0779672Z 2025-05-07T19:46:17.0779778Z 2025-05-07T19:46:17.0779784Z 2025-05-07T19:46:17.0779814Z 2025-05-07T19:46:17.0779877Z 2025-05-07T19:46:17.1731827Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:46:17.1901949Z nsight-compute-2024. | 443.1 MB | ########9 | 90% 2025-05-07T19:46:17.3309832Z 2025-05-07T19:46:17.3309848Z 2025-05-07T19:46:17.3309855Z 2025-05-07T19:46:17.3309860Z 2025-05-07T19:46:17.3309866Z 2025-05-07T19:46:17.3309872Z 2025-05-07T19:46:17.3309877Z 2025-05-07T19:46:17.3309881Z 2025-05-07T19:46:17.3309945Z 2025-05-07T19:46:17.3310690Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:17.4076183Z nsight-compute-2024. | 443.1 MB | #########1 | 91% 2025-05-07T19:46:17.4076525Z 2025-05-07T19:46:17.4076532Z 2025-05-07T19:46:17.4076538Z 2025-05-07T19:46:17.4076829Z 2025-05-07T19:46:17.4076835Z 2025-05-07T19:46:17.4076841Z 2025-05-07T19:46:17.4076847Z 2025-05-07T19:46:17.4076855Z 2025-05-07T19:46:17.4076860Z 2025-05-07T19:46:17.4076880Z 2025-05-07T19:46:17.4392012Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:17.5157285Z nsight-compute-2024. | 443.1 MB | #########2 | 93% 2025-05-07T19:46:17.5157647Z 2025-05-07T19:46:17.5157725Z 2025-05-07T19:46:17.5157729Z 2025-05-07T19:46:17.5157733Z 2025-05-07T19:46:17.5157737Z 2025-05-07T19:46:17.5157740Z 2025-05-07T19:46:17.5157744Z 2025-05-07T19:46:17.5157747Z 2025-05-07T19:46:17.5157750Z 2025-05-07T19:46:17.5157754Z 2025-05-07T19:46:17.5157758Z 2025-05-07T19:46:17.5682529Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:17.6849275Z nsight-compute-2024. | 443.1 MB | #########3 | 94% 2025-05-07T19:46:17.7826122Z nsight-compute-2024. | 443.1 MB | #########4 | 95% 2025-05-07T19:46:17.7826467Z 2025-05-07T19:46:17.7826474Z 2025-05-07T19:46:17.7826479Z 2025-05-07T19:46:17.7826482Z 2025-05-07T19:46:17.7826485Z 2025-05-07T19:46:17.7826491Z 2025-05-07T19:46:17.7826508Z 2025-05-07T19:46:17.7826512Z 2025-05-07T19:46:17.7826517Z 2025-05-07T19:46:17.7826522Z 2025-05-07T19:46:17.7826527Z 2025-05-07T19:46:17.7826531Z 2025-05-07T19:46:17.7863695Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:17.9652003Z nsight-compute-2024. | 443.1 MB | #########6 | 97% 2025-05-07T19:46:17.9687138Z nsight-compute-2024. | 443.1 MB | #########7 | 98% 2025-05-07T19:46:17.9687461Z 2025-05-07T19:46:17.9687467Z 2025-05-07T19:46:17.9687471Z 2025-05-07T19:46:17.9687475Z 2025-05-07T19:46:17.9687480Z 2025-05-07T19:46:17.9687483Z 2025-05-07T19:46:17.9687488Z 2025-05-07T19:46:17.9937247Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:46:17.9937613Z 2025-05-07T19:46:17.9937618Z 2025-05-07T19:46:17.9937658Z 2025-05-07T19:46:17.9937663Z 2025-05-07T19:46:17.9937668Z 2025-05-07T19:46:17.9937673Z 2025-05-07T19:46:17.9937678Z 2025-05-07T19:46:17.9937683Z 2025-05-07T19:46:17.9937686Z 2025-05-07T19:46:17.9937690Z 2025-05-07T19:46:17.9937714Z 2025-05-07T19:46:17.9937719Z 2025-05-07T19:46:17.9937731Z 2025-05-07T19:46:18.0741242Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:18.0741995Z 2025-05-07T19:46:18.0742000Z 2025-05-07T19:46:18.0742004Z 2025-05-07T19:46:18.0742008Z 2025-05-07T19:46:18.0742011Z 2025-05-07T19:46:18.0742014Z 2025-05-07T19:46:18.0742018Z 2025-05-07T19:46:18.0742022Z 2025-05-07T19:46:18.0742025Z 2025-05-07T19:46:18.0742050Z 2025-05-07T19:46:18.0742054Z 2025-05-07T19:46:18.0742057Z 2025-05-07T19:46:18.0742061Z 2025-05-07T19:46:18.0742064Z 2025-05-07T19:46:18.0742068Z 2025-05-07T19:46:18.0742071Z 2025-05-07T19:46:18.0742076Z 2025-05-07T19:46:18.1178538Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:18.1179020Z 2025-05-07T19:46:18.1179026Z 2025-05-07T19:46:18.1179034Z 2025-05-07T19:46:18.1179039Z 2025-05-07T19:46:18.1179042Z 2025-05-07T19:46:18.1179047Z 2025-05-07T19:46:18.1179051Z 2025-05-07T19:46:18.1179054Z 2025-05-07T19:46:18.1179057Z 2025-05-07T19:46:18.1179061Z 2025-05-07T19:46:18.1179064Z 2025-05-07T19:46:18.1179069Z 2025-05-07T19:46:18.1179090Z 2025-05-07T19:46:18.1179094Z 2025-05-07T19:46:18.1179097Z 2025-05-07T19:46:18.1179101Z 2025-05-07T19:46:18.1179104Z 2025-05-07T19:46:18.1179107Z 2025-05-07T19:46:18.1179111Z 2025-05-07T19:46:18.1179440Z ... (more hidden) ... 2025-05-07T19:46:18.1179855Z 2025-05-07T19:46:18.1179859Z 2025-05-07T19:46:18.1179862Z 2025-05-07T19:46:18.1179865Z 2025-05-07T19:46:18.1179869Z 2025-05-07T19:46:18.1179872Z 2025-05-07T19:46:18.1179875Z 2025-05-07T19:46:18.1179879Z 2025-05-07T19:46:18.1179882Z 2025-05-07T19:46:18.1180189Z 2025-05-07T19:46:18.1180194Z 2025-05-07T19:46:18.1180235Z 2025-05-07T19:46:18.1180239Z 2025-05-07T19:46:18.1180242Z 2025-05-07T19:46:18.1180245Z 2025-05-07T19:46:18.1180249Z 2025-05-07T19:46:18.1180252Z 2025-05-07T19:46:18.1180256Z 2025-05-07T19:46:18.1180259Z 2025-05-07T19:46:18.1260055Z ... (more hidden) ... 2025-05-07T19:46:18.1624629Z nsight-compute-2024. | 443.1 MB | #########8 | 99% 2025-05-07T19:46:18.1625056Z 2025-05-07T19:46:18.1625093Z 2025-05-07T19:46:18.1625122Z 2025-05-07T19:46:18.1625125Z 2025-05-07T19:46:18.1625129Z 2025-05-07T19:46:18.1625133Z 2025-05-07T19:46:18.1625136Z 2025-05-07T19:46:18.1625140Z 2025-05-07T19:46:18.1625143Z 2025-05-07T19:46:18.1625147Z 2025-05-07T19:46:18.1625150Z 2025-05-07T19:46:18.1625154Z 2025-05-07T19:46:18.1625157Z 2025-05-07T19:46:18.1625161Z 2025-05-07T19:46:18.2074061Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:18.2074546Z 2025-05-07T19:46:18.2074553Z 2025-05-07T19:46:18.2074558Z 2025-05-07T19:46:18.2074563Z 2025-05-07T19:46:18.2074571Z 2025-05-07T19:46:18.2074576Z 2025-05-07T19:46:18.2074583Z 2025-05-07T19:46:18.2074586Z 2025-05-07T19:46:18.2074590Z 2025-05-07T19:46:18.2074595Z 2025-05-07T19:46:18.2074600Z 2025-05-07T19:46:18.2074603Z 2025-05-07T19:46:18.2074608Z 2025-05-07T19:46:18.2074611Z 2025-05-07T19:46:18.2074636Z 2025-05-07T19:46:18.2074640Z 2025-05-07T19:46:18.2074661Z 2025-05-07T19:46:18.2074665Z 2025-05-07T19:46:18.3136617Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:18.3137061Z 2025-05-07T19:46:18.3137068Z 2025-05-07T19:46:18.3137073Z 2025-05-07T19:46:18.3137078Z 2025-05-07T19:46:18.3137084Z 2025-05-07T19:46:18.3137107Z 2025-05-07T19:46:18.3137112Z 2025-05-07T19:46:18.3137119Z 2025-05-07T19:46:18.3137124Z 2025-05-07T19:46:18.3137130Z 2025-05-07T19:46:18.3137136Z 2025-05-07T19:46:18.3137172Z 2025-05-07T19:46:18.3137178Z 2025-05-07T19:46:18.3137181Z 2025-05-07T19:46:18.3137186Z 2025-05-07T19:46:18.3635196Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:18.3635662Z 2025-05-07T19:46:18.3635669Z 2025-05-07T19:46:18.3635675Z 2025-05-07T19:46:18.3635680Z 2025-05-07T19:46:18.3635685Z 2025-05-07T19:46:18.3635690Z 2025-05-07T19:46:18.3636008Z 2025-05-07T19:46:18.3636013Z 2025-05-07T19:46:18.3636017Z 2025-05-07T19:46:18.3636021Z 2025-05-07T19:46:18.3636025Z 2025-05-07T19:46:18.3636028Z 2025-05-07T19:46:18.3636041Z 2025-05-07T19:46:18.3636045Z 2025-05-07T19:46:18.3636048Z 2025-05-07T19:46:18.3636051Z 2025-05-07T19:46:20.1558636Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:20.1559056Z 2025-05-07T19:46:20.4821351Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:20.4821816Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:23.0623780Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:23.0629158Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:23.0629450Z 2025-05-07T19:46:23.0629455Z 2025-05-07T19:46:23.0629459Z 2025-05-07T19:46:23.0629463Z 2025-05-07T19:46:23.0629467Z 2025-05-07T19:46:23.0629706Z 2025-05-07T19:46:23.0629719Z 2025-05-07T19:46:23.0629724Z 2025-05-07T19:46:23.0629761Z 2025-05-07T19:46:23.0629765Z 2025-05-07T19:46:23.0629771Z 2025-05-07T19:46:23.0629778Z 2025-05-07T19:46:23.0629783Z 2025-05-07T19:46:23.0629787Z 2025-05-07T19:46:23.0629791Z 2025-05-07T19:46:23.0629834Z 2025-05-07T19:46:23.0629837Z 2025-05-07T19:46:23.0629842Z 2025-05-07T19:46:23.0629846Z 2025-05-07T19:46:23.0630057Z 2025-05-07T19:46:23.0630632Z  2025-05-07T19:46:23.0630988Z 2025-05-07T19:46:23.0631198Z 2025-05-07T19:46:23.0631651Z  2025-05-07T19:46:23.0631863Z 2025-05-07T19:46:23.0631867Z 2025-05-07T19:46:23.0632033Z  2025-05-07T19:46:23.0632259Z 2025-05-07T19:46:23.0632263Z 2025-05-07T19:46:23.0632267Z 2025-05-07T19:46:23.0632438Z  2025-05-07T19:46:23.0632670Z 2025-05-07T19:46:23.0632681Z 2025-05-07T19:46:23.0632684Z 2025-05-07T19:46:23.0632687Z 2025-05-07T19:46:23.0632858Z  2025-05-07T19:46:23.0633076Z 2025-05-07T19:46:23.0633080Z 2025-05-07T19:46:23.0633083Z 2025-05-07T19:46:23.0633086Z 2025-05-07T19:46:23.0633090Z 2025-05-07T19:46:23.0633278Z  2025-05-07T19:46:23.0633495Z 2025-05-07T19:46:23.0633498Z 2025-05-07T19:46:23.0633502Z 2025-05-07T19:46:23.0633505Z 2025-05-07T19:46:23.0633509Z 2025-05-07T19:46:23.0633516Z 2025-05-07T19:46:23.0633713Z  2025-05-07T19:46:23.0633933Z 2025-05-07T19:46:23.0633936Z 2025-05-07T19:46:23.0633940Z 2025-05-07T19:46:23.0633944Z 2025-05-07T19:46:23.0633947Z 2025-05-07T19:46:23.0633951Z 2025-05-07T19:46:23.0633954Z 2025-05-07T19:46:23.0634132Z  2025-05-07T19:46:23.0634373Z 2025-05-07T19:46:23.0634376Z 2025-05-07T19:46:23.0634380Z 2025-05-07T19:46:23.0634383Z 2025-05-07T19:46:23.0634386Z 2025-05-07T19:46:23.0634390Z 2025-05-07T19:46:23.0634393Z 2025-05-07T19:46:23.0634396Z 2025-05-07T19:46:23.0634578Z  2025-05-07T19:46:23.0634900Z 2025-05-07T19:46:23.0634903Z 2025-05-07T19:46:23.0634907Z 2025-05-07T19:46:23.0634910Z 2025-05-07T19:46:23.0634913Z 2025-05-07T19:46:23.0634916Z 2025-05-07T19:46:23.0634920Z 2025-05-07T19:46:23.0634924Z 2025-05-07T19:46:23.0634931Z 2025-05-07T19:46:23.0635145Z  2025-05-07T19:46:23.0635375Z 2025-05-07T19:46:23.0635379Z 2025-05-07T19:46:23.0635382Z 2025-05-07T19:46:23.0635385Z 2025-05-07T19:46:23.0635389Z 2025-05-07T19:46:23.0635392Z 2025-05-07T19:46:23.0635395Z 2025-05-07T19:46:23.0635399Z 2025-05-07T19:46:23.0635402Z 2025-05-07T19:46:23.0635537Z 2025-05-07T19:46:23.0635747Z  2025-05-07T19:46:23.0635977Z 2025-05-07T19:46:23.0635981Z 2025-05-07T19:46:23.0635984Z 2025-05-07T19:46:23.0635987Z 2025-05-07T19:46:23.0635991Z 2025-05-07T19:46:23.0635994Z 2025-05-07T19:46:23.0635997Z 2025-05-07T19:46:23.0636001Z 2025-05-07T19:46:23.0636004Z 2025-05-07T19:46:23.0636008Z 2025-05-07T19:46:23.0636011Z 2025-05-07T19:46:23.0636221Z  2025-05-07T19:46:23.0636452Z 2025-05-07T19:46:23.0636460Z 2025-05-07T19:46:23.0636464Z 2025-05-07T19:46:23.0636467Z 2025-05-07T19:46:23.0636470Z 2025-05-07T19:46:23.0636474Z 2025-05-07T19:46:23.0636477Z 2025-05-07T19:46:23.0636481Z 2025-05-07T19:46:23.0636484Z 2025-05-07T19:46:23.0636487Z 2025-05-07T19:46:23.0636491Z 2025-05-07T19:46:23.0636494Z 2025-05-07T19:46:23.0636704Z  2025-05-07T19:46:23.0636943Z 2025-05-07T19:46:23.0636946Z 2025-05-07T19:46:23.0636950Z 2025-05-07T19:46:23.0636953Z 2025-05-07T19:46:23.0636957Z 2025-05-07T19:46:23.0636960Z 2025-05-07T19:46:23.0636964Z 2025-05-07T19:46:23.0636967Z 2025-05-07T19:46:23.0636971Z 2025-05-07T19:46:23.0636974Z 2025-05-07T19:46:23.0636978Z 2025-05-07T19:46:23.0636981Z 2025-05-07T19:46:23.0636999Z 2025-05-07T19:46:23.0637200Z  2025-05-07T19:46:23.0637436Z 2025-05-07T19:46:23.0637439Z 2025-05-07T19:46:23.0637497Z 2025-05-07T19:46:23.0637501Z 2025-05-07T19:46:23.0637505Z 2025-05-07T19:46:23.0637508Z 2025-05-07T19:46:23.0637512Z 2025-05-07T19:46:23.0637515Z 2025-05-07T19:46:23.0637535Z 2025-05-07T19:46:23.0637538Z 2025-05-07T19:46:23.0637542Z 2025-05-07T19:46:23.0637545Z 2025-05-07T19:46:23.0637549Z 2025-05-07T19:46:23.0637552Z 2025-05-07T19:46:23.0637765Z  2025-05-07T19:46:23.0638008Z 2025-05-07T19:46:23.0638011Z 2025-05-07T19:46:23.0638015Z 2025-05-07T19:46:23.0638018Z 2025-05-07T19:46:23.0638036Z 2025-05-07T19:46:23.0638039Z 2025-05-07T19:46:23.0638042Z 2025-05-07T19:46:23.0638046Z 2025-05-07T19:46:23.0638049Z 2025-05-07T19:46:23.0638052Z 2025-05-07T19:46:23.0638056Z 2025-05-07T19:46:23.0638059Z 2025-05-07T19:46:23.0638063Z 2025-05-07T19:46:23.0638066Z 2025-05-07T19:46:23.0638069Z 2025-05-07T19:46:23.0638320Z  2025-05-07T19:46:23.0638562Z 2025-05-07T19:46:23.0638566Z 2025-05-07T19:46:23.0638569Z 2025-05-07T19:46:23.0638573Z 2025-05-07T19:46:23.0638576Z 2025-05-07T19:46:23.0638580Z 2025-05-07T19:46:23.0638583Z 2025-05-07T19:46:23.0638587Z 2025-05-07T19:46:23.0638590Z 2025-05-07T19:46:23.0638594Z 2025-05-07T19:46:23.0638597Z 2025-05-07T19:46:23.0638600Z 2025-05-07T19:46:23.0638604Z 2025-05-07T19:46:23.0638610Z 2025-05-07T19:46:23.0638614Z 2025-05-07T19:46:23.0638617Z 2025-05-07T19:46:23.0638846Z  2025-05-07T19:46:23.0639090Z 2025-05-07T19:46:23.0639094Z 2025-05-07T19:46:23.0639097Z 2025-05-07T19:46:23.0639101Z 2025-05-07T19:46:23.0639104Z 2025-05-07T19:46:23.0639108Z 2025-05-07T19:46:23.0639111Z 2025-05-07T19:46:23.0639114Z 2025-05-07T19:46:23.0639118Z 2025-05-07T19:46:23.0639135Z 2025-05-07T19:46:23.0639138Z 2025-05-07T19:46:23.0639142Z 2025-05-07T19:46:23.0639149Z 2025-05-07T19:46:23.0639152Z 2025-05-07T19:46:23.0639156Z 2025-05-07T19:46:23.0639159Z 2025-05-07T19:46:23.0639162Z 2025-05-07T19:46:23.0639380Z  2025-05-07T19:46:23.0639622Z 2025-05-07T19:46:23.0639626Z 2025-05-07T19:46:23.0639646Z 2025-05-07T19:46:23.0639650Z 2025-05-07T19:46:23.0639653Z 2025-05-07T19:46:23.0639717Z 2025-05-07T19:46:23.0639721Z 2025-05-07T19:46:23.0639724Z 2025-05-07T19:46:23.0639727Z 2025-05-07T19:46:23.0639731Z 2025-05-07T19:46:23.0639734Z 2025-05-07T19:46:23.0639738Z 2025-05-07T19:46:23.0639741Z 2025-05-07T19:46:23.0639744Z 2025-05-07T19:46:23.0639748Z 2025-05-07T19:46:23.0639751Z 2025-05-07T19:46:23.0639755Z 2025-05-07T19:46:23.0639758Z 2025-05-07T19:46:23.0639985Z  2025-05-07T19:46:23.0640252Z 2025-05-07T19:46:23.0640255Z 2025-05-07T19:46:23.0640356Z  2025-05-07T19:46:23.0640460Z 2025-05-07T19:46:23.0640463Z 2025-05-07T19:46:23.0640575Z  2025-05-07T19:46:23.0640686Z 2025-05-07T19:46:23.0640689Z 2025-05-07T19:46:23.0640692Z 2025-05-07T19:46:23.0640792Z  2025-05-07T19:46:23.0640918Z 2025-05-07T19:46:23.0640922Z 2025-05-07T19:46:23.0640925Z 2025-05-07T19:46:23.0640928Z 2025-05-07T19:46:23.0641033Z  2025-05-07T19:46:23.0641154Z 2025-05-07T19:46:23.0641162Z 2025-05-07T19:46:23.0641165Z 2025-05-07T19:46:23.0641185Z 2025-05-07T19:46:23.0641189Z 2025-05-07T19:46:23.0641295Z  2025-05-07T19:46:23.0641420Z 2025-05-07T19:46:23.0641424Z 2025-05-07T19:46:23.0641427Z 2025-05-07T19:46:23.0641431Z 2025-05-07T19:46:23.0641434Z 2025-05-07T19:46:23.0641438Z 2025-05-07T19:46:23.0641563Z  2025-05-07T19:46:23.0641692Z 2025-05-07T19:46:23.0641695Z 2025-05-07T19:46:23.0641699Z 2025-05-07T19:46:23.0641702Z 2025-05-07T19:46:23.0641705Z 2025-05-07T19:46:23.0641709Z 2025-05-07T19:46:23.0641712Z 2025-05-07T19:46:23.0641874Z  2025-05-07T19:46:23.0642031Z 2025-05-07T19:46:23.0642034Z 2025-05-07T19:46:23.0642037Z 2025-05-07T19:46:23.0642041Z 2025-05-07T19:46:23.0642044Z 2025-05-07T19:46:23.0642048Z 2025-05-07T19:46:23.0642051Z 2025-05-07T19:46:23.0642055Z 2025-05-07T19:46:23.0642171Z  2025-05-07T19:46:23.0642336Z 2025-05-07T19:46:23.0642339Z 2025-05-07T19:46:23.0642347Z 2025-05-07T19:46:23.0642350Z 2025-05-07T19:46:23.0642353Z 2025-05-07T19:46:23.0642357Z 2025-05-07T19:46:23.0642360Z 2025-05-07T19:46:23.0642363Z 2025-05-07T19:46:23.0642367Z 2025-05-07T19:46:23.0642485Z  2025-05-07T19:46:23.0642657Z 2025-05-07T19:46:23.0642660Z 2025-05-07T19:46:23.0642664Z 2025-05-07T19:46:23.0642667Z 2025-05-07T19:46:23.0642671Z 2025-05-07T19:46:23.0642674Z 2025-05-07T19:46:23.0642678Z 2025-05-07T19:46:23.0642681Z 2025-05-07T19:46:23.0642684Z 2025-05-07T19:46:23.0642688Z 2025-05-07T19:46:23.0642877Z  2025-05-07T19:46:23.0643042Z 2025-05-07T19:46:23.0643046Z 2025-05-07T19:46:23.0643049Z 2025-05-07T19:46:23.0643053Z 2025-05-07T19:46:23.0643056Z 2025-05-07T19:46:23.0643059Z 2025-05-07T19:46:23.0643062Z 2025-05-07T19:46:23.0643066Z 2025-05-07T19:46:23.0643069Z 2025-05-07T19:46:23.0643073Z 2025-05-07T19:46:23.0643076Z 2025-05-07T19:46:23.0643216Z  2025-05-07T19:46:23.0643394Z 2025-05-07T19:46:23.0643398Z 2025-05-07T19:46:23.0643401Z 2025-05-07T19:46:23.0643404Z 2025-05-07T19:46:23.0643408Z 2025-05-07T19:46:23.0643411Z 2025-05-07T19:46:23.0643414Z 2025-05-07T19:46:23.0643418Z 2025-05-07T19:46:23.0643421Z 2025-05-07T19:46:23.0643424Z 2025-05-07T19:46:23.0643428Z 2025-05-07T19:46:23.0643445Z 2025-05-07T19:46:23.0643573Z  2025-05-07T19:46:23.0643760Z 2025-05-07T19:46:23.0643763Z 2025-05-07T19:46:23.0643767Z 2025-05-07T19:46:23.0643770Z 2025-05-07T19:46:23.0643773Z 2025-05-07T19:46:23.0643777Z 2025-05-07T19:46:23.0643783Z 2025-05-07T19:46:23.0643787Z 2025-05-07T19:46:23.0643791Z 2025-05-07T19:46:23.0643794Z 2025-05-07T19:46:23.0643811Z 2025-05-07T19:46:23.0643815Z 2025-05-07T19:46:23.0643818Z 2025-05-07T19:46:23.0643956Z  2025-05-07T19:46:23.0644149Z 2025-05-07T19:46:23.0644152Z 2025-05-07T19:46:23.0644156Z 2025-05-07T19:46:23.0644159Z 2025-05-07T19:46:23.0644162Z 2025-05-07T19:46:23.0644218Z 2025-05-07T19:46:23.0644222Z 2025-05-07T19:46:23.0644243Z 2025-05-07T19:46:23.0644246Z 2025-05-07T19:46:23.0644250Z 2025-05-07T19:46:23.0644253Z 2025-05-07T19:46:23.0644256Z 2025-05-07T19:46:23.0644259Z 2025-05-07T19:46:23.0644263Z 2025-05-07T19:46:23.0644404Z  2025-05-07T19:46:23.0644601Z 2025-05-07T19:46:23.0644605Z 2025-05-07T19:46:23.0644608Z 2025-05-07T19:46:23.0644612Z 2025-05-07T19:46:23.0644631Z 2025-05-07T19:46:23.0644634Z 2025-05-07T19:46:23.0644638Z 2025-05-07T19:46:23.0644641Z 2025-05-07T19:46:23.0644644Z 2025-05-07T19:46:23.0644651Z 2025-05-07T19:46:23.0644655Z 2025-05-07T19:46:23.0644658Z 2025-05-07T19:46:23.0644661Z 2025-05-07T19:46:23.0644665Z 2025-05-07T19:46:23.0644668Z 2025-05-07T19:46:23.0644814Z  2025-05-07T19:46:23.0645036Z 2025-05-07T19:46:23.0645039Z 2025-05-07T19:46:23.0645043Z 2025-05-07T19:46:23.0645046Z 2025-05-07T19:46:23.0645049Z 2025-05-07T19:46:23.0645056Z 2025-05-07T19:46:23.0645059Z 2025-05-07T19:46:23.0645063Z 2025-05-07T19:46:23.0645066Z 2025-05-07T19:46:23.0645070Z 2025-05-07T19:46:23.0645073Z 2025-05-07T19:46:23.0645076Z 2025-05-07T19:46:23.0645079Z 2025-05-07T19:46:23.0645083Z 2025-05-07T19:46:23.0645086Z 2025-05-07T19:46:23.0645089Z 2025-05-07T19:46:23.0645242Z  2025-05-07T19:46:23.0645466Z 2025-05-07T19:46:23.0645470Z 2025-05-07T19:46:23.0645474Z 2025-05-07T19:46:23.0645478Z 2025-05-07T19:46:23.0645481Z 2025-05-07T19:46:23.0645485Z 2025-05-07T19:46:23.0645488Z 2025-05-07T19:46:23.0645543Z 2025-05-07T19:46:23.0645547Z 2025-05-07T19:46:23.0645551Z 2025-05-07T19:46:23.0645554Z 2025-05-07T19:46:23.0645557Z 2025-05-07T19:46:23.0645560Z 2025-05-07T19:46:23.0645564Z 2025-05-07T19:46:23.0645568Z 2025-05-07T19:46:23.0645571Z 2025-05-07T19:46:23.0645575Z 2025-05-07T19:46:23.0645745Z  2025-05-07T19:46:23.0645961Z 2025-05-07T19:46:23.0645968Z 2025-05-07T19:46:23.0645971Z 2025-05-07T19:46:23.0645975Z 2025-05-07T19:46:23.0645978Z 2025-05-07T19:46:23.0645982Z 2025-05-07T19:46:23.0645985Z 2025-05-07T19:46:23.0645988Z 2025-05-07T19:46:23.0645992Z 2025-05-07T19:46:23.0645995Z 2025-05-07T19:46:23.0645999Z 2025-05-07T19:46:23.0646017Z 2025-05-07T19:46:23.0646020Z 2025-05-07T19:46:23.0646024Z 2025-05-07T19:46:23.0646027Z 2025-05-07T19:46:23.0646030Z 2025-05-07T19:46:23.0646033Z 2025-05-07T19:46:23.0646037Z 2025-05-07T19:46:23.0646200Z  2025-05-07T19:46:23.0646424Z 2025-05-07T19:46:23.0646428Z 2025-05-07T19:46:23.0646543Z  2025-05-07T19:46:23.0646648Z 2025-05-07T19:46:23.0646651Z 2025-05-07T19:46:23.0646747Z  2025-05-07T19:46:23.0646871Z 2025-05-07T19:46:23.0646874Z 2025-05-07T19:46:23.0646878Z 2025-05-07T19:46:23.0646978Z  2025-05-07T19:46:23.0647087Z 2025-05-07T19:46:23.0647090Z 2025-05-07T19:46:23.0647094Z 2025-05-07T19:46:23.0647097Z 2025-05-07T19:46:23.0647220Z  2025-05-07T19:46:23.0647338Z 2025-05-07T19:46:23.0647341Z 2025-05-07T19:46:23.0647344Z 2025-05-07T19:46:23.0647348Z 2025-05-07T19:46:23.0647351Z 2025-05-07T19:46:23.0647454Z  2025-05-07T19:46:23.0647594Z 2025-05-07T19:46:23.0647598Z 2025-05-07T19:46:23.0647601Z 2025-05-07T19:46:23.0647605Z 2025-05-07T19:46:23.0647608Z 2025-05-07T19:46:23.0647611Z 2025-05-07T19:46:23.0647719Z  2025-05-07T19:46:23.0647848Z 2025-05-07T19:46:23.0647867Z 2025-05-07T19:46:23.0647871Z 2025-05-07T19:46:23.0647874Z 2025-05-07T19:46:23.0647881Z 2025-05-07T19:46:23.0647885Z 2025-05-07T19:46:23.0647888Z 2025-05-07T19:46:23.0647998Z  2025-05-07T19:46:23.0648136Z 2025-05-07T19:46:23.0648139Z 2025-05-07T19:46:23.0648142Z 2025-05-07T19:46:23.0648146Z 2025-05-07T19:46:23.0648149Z 2025-05-07T19:46:23.0648166Z 2025-05-07T19:46:23.0648169Z 2025-05-07T19:46:23.0648172Z 2025-05-07T19:46:23.0648329Z  2025-05-07T19:46:23.0648566Z 2025-05-07T19:46:23.0648570Z 2025-05-07T19:46:23.0648573Z 2025-05-07T19:46:23.0648576Z 2025-05-07T19:46:23.0648580Z 2025-05-07T19:46:23.0648583Z 2025-05-07T19:46:23.0648586Z 2025-05-07T19:46:23.0648590Z 2025-05-07T19:46:23.0648593Z 2025-05-07T19:46:23.0648716Z  2025-05-07T19:46:23.0648884Z 2025-05-07T19:46:23.0648887Z 2025-05-07T19:46:23.0648891Z 2025-05-07T19:46:23.0648895Z 2025-05-07T19:46:23.0648898Z 2025-05-07T19:46:23.0648901Z 2025-05-07T19:46:23.0648905Z 2025-05-07T19:46:23.0648908Z 2025-05-07T19:46:23.0648912Z 2025-05-07T19:46:23.0648918Z 2025-05-07T19:46:23.0649041Z  2025-05-07T19:46:23.0649219Z 2025-05-07T19:46:23.0649223Z 2025-05-07T19:46:23.0649226Z 2025-05-07T19:46:23.0649230Z 2025-05-07T19:46:23.0649233Z 2025-05-07T19:46:23.0649236Z 2025-05-07T19:46:23.0649240Z 2025-05-07T19:46:23.0649243Z 2025-05-07T19:46:23.0649247Z 2025-05-07T19:46:23.0649250Z 2025-05-07T19:46:23.0649253Z 2025-05-07T19:46:23.0649382Z  2025-05-07T19:46:23.0649572Z 2025-05-07T19:46:23.0649576Z 2025-05-07T19:46:23.0649579Z 2025-05-07T19:46:23.0649583Z 2025-05-07T19:46:23.0649586Z 2025-05-07T19:46:23.0649589Z 2025-05-07T19:46:23.0649593Z 2025-05-07T19:46:23.0649596Z 2025-05-07T19:46:23.0649600Z 2025-05-07T19:46:23.0649603Z 2025-05-07T19:46:23.0649606Z 2025-05-07T19:46:23.0649610Z 2025-05-07T19:46:23.0649738Z  2025-05-07T19:46:23.0649937Z 2025-05-07T19:46:23.0649941Z 2025-05-07T19:46:23.0649944Z 2025-05-07T19:46:23.0649947Z 2025-05-07T19:46:23.0650003Z 2025-05-07T19:46:23.0650007Z 2025-05-07T19:46:23.0650011Z 2025-05-07T19:46:23.0650014Z 2025-05-07T19:46:23.0650017Z 2025-05-07T19:46:23.0650021Z 2025-05-07T19:46:23.0650024Z 2025-05-07T19:46:23.0650028Z 2025-05-07T19:46:23.0650031Z 2025-05-07T19:46:23.0650164Z  2025-05-07T19:46:23.0650367Z 2025-05-07T19:46:23.0650371Z 2025-05-07T19:46:23.0650374Z 2025-05-07T19:46:23.0650381Z 2025-05-07T19:46:23.0650384Z 2025-05-07T19:46:23.0650388Z 2025-05-07T19:46:23.0650391Z 2025-05-07T19:46:23.0650395Z 2025-05-07T19:46:23.0650398Z 2025-05-07T19:46:23.0650401Z 2025-05-07T19:46:23.0650405Z 2025-05-07T19:46:23.0650408Z 2025-05-07T19:46:23.0650412Z 2025-05-07T19:46:23.0650415Z 2025-05-07T19:46:23.0650568Z  2025-05-07T19:46:23.0650767Z 2025-05-07T19:46:23.0650770Z 2025-05-07T19:46:23.0650774Z 2025-05-07T19:46:23.0650777Z 2025-05-07T19:46:23.0650780Z 2025-05-07T19:46:23.0650784Z 2025-05-07T19:46:23.0650790Z 2025-05-07T19:46:23.0650794Z 2025-05-07T19:46:23.0650797Z 2025-05-07T19:46:23.0650801Z 2025-05-07T19:46:23.0650804Z 2025-05-07T19:46:23.0650807Z 2025-05-07T19:46:23.0650811Z 2025-05-07T19:46:23.0650814Z 2025-05-07T19:46:23.0650817Z 2025-05-07T19:46:23.0650977Z  2025-05-07T19:46:23.0651184Z 2025-05-07T19:46:23.0651187Z 2025-05-07T19:46:23.0651191Z 2025-05-07T19:46:23.0651197Z 2025-05-07T19:46:23.0651201Z 2025-05-07T19:46:23.0651204Z 2025-05-07T19:46:23.0651207Z 2025-05-07T19:46:23.0651211Z 2025-05-07T19:46:23.0651214Z 2025-05-07T19:46:23.0651218Z 2025-05-07T19:46:23.0651235Z 2025-05-07T19:46:23.0651238Z 2025-05-07T19:46:23.0651242Z 2025-05-07T19:46:23.0651245Z 2025-05-07T19:46:23.0651249Z 2025-05-07T19:46:23.0651252Z 2025-05-07T19:46:23.0651402Z  2025-05-07T19:46:23.0651612Z 2025-05-07T19:46:23.0651616Z 2025-05-07T19:46:23.0651619Z 2025-05-07T19:46:23.0651623Z 2025-05-07T19:46:23.0651629Z 2025-05-07T19:46:23.0651647Z 2025-05-07T19:46:23.0651650Z 2025-05-07T19:46:23.0651654Z 2025-05-07T19:46:23.0651657Z 2025-05-07T19:46:23.0651661Z 2025-05-07T19:46:23.0651664Z 2025-05-07T19:46:23.0651667Z 2025-05-07T19:46:23.0651671Z 2025-05-07T19:46:23.0651674Z 2025-05-07T19:46:23.0651677Z 2025-05-07T19:46:23.0651681Z 2025-05-07T19:46:23.0651684Z 2025-05-07T19:46:23.0651838Z  2025-05-07T19:46:23.0652121Z 2025-05-07T19:46:23.0652125Z 2025-05-07T19:46:23.0652129Z 2025-05-07T19:46:23.0652132Z 2025-05-07T19:46:23.0652135Z 2025-05-07T19:46:23.0652139Z 2025-05-07T19:46:23.0652142Z 2025-05-07T19:46:23.0652146Z 2025-05-07T19:46:23.0652149Z 2025-05-07T19:46:23.0652152Z 2025-05-07T19:46:23.0652155Z 2025-05-07T19:46:23.0652159Z 2025-05-07T19:46:23.0652162Z 2025-05-07T19:46:23.0652166Z 2025-05-07T19:46:23.0652169Z 2025-05-07T19:46:23.0652172Z 2025-05-07T19:46:23.0652176Z 2025-05-07T19:46:23.0652179Z 2025-05-07T19:46:23.0652361Z  2025-05-07T19:46:23.0652584Z 2025-05-07T19:46:23.0652588Z 2025-05-07T19:46:23.0652682Z  2025-05-07T19:46:23.0652799Z 2025-05-07T19:46:23.0652802Z 2025-05-07T19:46:23.0652900Z  2025-05-07T19:46:23.0653007Z 2025-05-07T19:46:23.0653011Z 2025-05-07T19:46:23.0653014Z 2025-05-07T19:46:23.0653127Z  2025-05-07T19:46:23.0653235Z 2025-05-07T19:46:23.0653243Z 2025-05-07T19:46:23.0653247Z 2025-05-07T19:46:23.0653250Z 2025-05-07T19:46:23.0653351Z  2025-05-07T19:46:23.0653481Z 2025-05-07T19:46:23.0653485Z 2025-05-07T19:46:23.0653488Z 2025-05-07T19:46:23.0653492Z 2025-05-07T19:46:23.0653495Z 2025-05-07T19:46:23.0653600Z  2025-05-07T19:46:23.0653724Z 2025-05-07T19:46:23.0653727Z 2025-05-07T19:46:23.0653731Z 2025-05-07T19:46:23.0653734Z 2025-05-07T19:46:23.0653737Z 2025-05-07T19:46:23.0653741Z 2025-05-07T19:46:23.0653862Z  2025-05-07T19:46:23.0653989Z 2025-05-07T19:46:23.0653993Z 2025-05-07T19:46:23.0654047Z 2025-05-07T19:46:23.0654051Z 2025-05-07T19:46:23.0654054Z 2025-05-07T19:46:23.0654057Z 2025-05-07T19:46:23.0654061Z 2025-05-07T19:46:23.0654186Z  2025-05-07T19:46:23.0654325Z 2025-05-07T19:46:23.0654329Z 2025-05-07T19:46:23.0654332Z 2025-05-07T19:46:23.0654336Z 2025-05-07T19:46:23.0654339Z 2025-05-07T19:46:23.0654343Z 2025-05-07T19:46:23.0654346Z 2025-05-07T19:46:23.0654352Z 2025-05-07T19:46:23.0654467Z  2025-05-07T19:46:23.0654631Z 2025-05-07T19:46:23.0654635Z 2025-05-07T19:46:23.0654638Z 2025-05-07T19:46:23.0654641Z 2025-05-07T19:46:23.0654645Z 2025-05-07T19:46:23.0654648Z 2025-05-07T19:46:23.0654652Z 2025-05-07T19:46:23.0654655Z 2025-05-07T19:46:23.0654659Z 2025-05-07T19:46:23.0654778Z  2025-05-07T19:46:23.0654949Z 2025-05-07T19:46:23.0654953Z 2025-05-07T19:46:23.0654956Z 2025-05-07T19:46:23.0654959Z 2025-05-07T19:46:23.0654963Z 2025-05-07T19:46:23.0654966Z 2025-05-07T19:46:23.0654969Z 2025-05-07T19:46:23.0654976Z 2025-05-07T19:46:23.0654980Z 2025-05-07T19:46:23.0654983Z 2025-05-07T19:46:23.0655106Z  2025-05-07T19:46:23.0655288Z 2025-05-07T19:46:23.0655291Z 2025-05-07T19:46:23.0655295Z 2025-05-07T19:46:23.0655298Z 2025-05-07T19:46:23.0655301Z 2025-05-07T19:46:23.0655305Z 2025-05-07T19:46:23.0655308Z 2025-05-07T19:46:23.0655311Z 2025-05-07T19:46:23.0655315Z 2025-05-07T19:46:23.0655321Z 2025-05-07T19:46:23.0655325Z 2025-05-07T19:46:23.0655460Z  2025-05-07T19:46:23.0655650Z 2025-05-07T19:46:23.0655653Z 2025-05-07T19:46:23.0655657Z 2025-05-07T19:46:23.0655660Z 2025-05-07T19:46:23.0655663Z 2025-05-07T19:46:23.0655667Z 2025-05-07T19:46:23.0655670Z 2025-05-07T19:46:23.0655673Z 2025-05-07T19:46:23.0655677Z 2025-05-07T19:46:23.0655680Z 2025-05-07T19:46:23.0655683Z 2025-05-07T19:46:23.0655687Z 2025-05-07T19:46:23.0658669Z  2025-05-07T19:46:23.0658879Z 2025-05-07T19:46:23.0658898Z 2025-05-07T19:46:23.0658902Z 2025-05-07T19:46:23.0658906Z 2025-05-07T19:46:23.0658909Z 2025-05-07T19:46:23.0658912Z 2025-05-07T19:46:23.0658916Z 2025-05-07T19:46:23.0658919Z 2025-05-07T19:46:23.0658923Z 2025-05-07T19:46:23.0658927Z 2025-05-07T19:46:23.0658930Z 2025-05-07T19:46:23.0658933Z 2025-05-07T19:46:23.0658937Z 2025-05-07T19:46:23.0659089Z  2025-05-07T19:46:23.0660501Z 2025-05-07T19:46:23.0660504Z 2025-05-07T19:46:23.0660508Z 2025-05-07T19:46:23.0660511Z 2025-05-07T19:46:23.0660515Z 2025-05-07T19:46:23.0660518Z 2025-05-07T19:46:23.0660522Z 2025-05-07T19:46:23.0660525Z 2025-05-07T19:46:23.0660528Z 2025-05-07T19:46:23.0660532Z 2025-05-07T19:46:23.0660535Z 2025-05-07T19:46:23.0660538Z 2025-05-07T19:46:23.0660542Z 2025-05-07T19:46:23.0660545Z 2025-05-07T19:46:23.0660714Z  2025-05-07T19:46:23.0660913Z 2025-05-07T19:46:23.0660916Z 2025-05-07T19:46:23.0660920Z 2025-05-07T19:46:23.0660924Z 2025-05-07T19:46:23.0660934Z 2025-05-07T19:46:23.0660938Z 2025-05-07T19:46:23.0660941Z 2025-05-07T19:46:23.0660944Z 2025-05-07T19:46:23.0660948Z 2025-05-07T19:46:23.0660951Z 2025-05-07T19:46:23.0660955Z 2025-05-07T19:46:23.0660958Z 2025-05-07T19:46:23.0660962Z 2025-05-07T19:46:23.0660978Z 2025-05-07T19:46:23.0660982Z 2025-05-07T19:46:23.0661126Z  2025-05-07T19:46:23.0661338Z 2025-05-07T19:46:23.0661341Z 2025-05-07T19:46:23.0661345Z 2025-05-07T19:46:23.0661348Z 2025-05-07T19:46:23.0661351Z 2025-05-07T19:46:23.0661354Z 2025-05-07T19:46:23.0661358Z 2025-05-07T19:46:23.0661361Z 2025-05-07T19:46:23.0661365Z 2025-05-07T19:46:23.0661385Z 2025-05-07T19:46:23.0661389Z 2025-05-07T19:46:23.0661392Z 2025-05-07T19:46:23.0661396Z 2025-05-07T19:46:23.0661399Z 2025-05-07T19:46:23.0661403Z 2025-05-07T19:46:23.0661406Z 2025-05-07T19:46:23.0661560Z  2025-05-07T19:46:23.0661768Z 2025-05-07T19:46:23.0661772Z 2025-05-07T19:46:23.0661833Z 2025-05-07T19:46:23.0661852Z 2025-05-07T19:46:23.0661855Z 2025-05-07T19:46:23.0661860Z 2025-05-07T19:46:23.0661863Z 2025-05-07T19:46:23.0661866Z 2025-05-07T19:46:23.0661870Z 2025-05-07T19:46:23.0661873Z 2025-05-07T19:46:23.0661877Z 2025-05-07T19:46:23.0661880Z 2025-05-07T19:46:23.0661884Z 2025-05-07T19:46:23.0661887Z 2025-05-07T19:46:23.0661891Z 2025-05-07T19:46:23.0661894Z 2025-05-07T19:46:23.0661901Z 2025-05-07T19:46:23.0662061Z  2025-05-07T19:46:23.0662290Z 2025-05-07T19:46:23.0662293Z 2025-05-07T19:46:23.0662297Z 2025-05-07T19:46:23.0662300Z 2025-05-07T19:46:23.0662303Z 2025-05-07T19:46:23.0662307Z 2025-05-07T19:46:23.0662310Z 2025-05-07T19:46:23.0662314Z 2025-05-07T19:46:23.0662317Z 2025-05-07T19:46:23.0662320Z 2025-05-07T19:46:23.0662324Z 2025-05-07T19:46:23.0662327Z 2025-05-07T19:46:23.0662330Z 2025-05-07T19:46:23.0662334Z 2025-05-07T19:46:23.0662337Z 2025-05-07T19:46:23.0662340Z 2025-05-07T19:46:23.0662348Z 2025-05-07T19:46:23.0662351Z 2025-05-07T19:46:23.0662530Z  2025-05-07T19:46:23.0662747Z 2025-05-07T19:46:23.0662751Z 2025-05-07T19:46:23.0662846Z  2025-05-07T19:46:23.0662967Z 2025-05-07T19:46:23.0662971Z 2025-05-07T19:46:23.0663070Z  2025-05-07T19:46:23.0663177Z 2025-05-07T19:46:23.0663180Z 2025-05-07T19:46:23.0663184Z 2025-05-07T19:46:23.0663315Z  2025-05-07T19:46:23.0663426Z 2025-05-07T19:46:23.0663430Z 2025-05-07T19:46:23.0663433Z 2025-05-07T19:46:23.0663436Z 2025-05-07T19:46:23.0663538Z  2025-05-07T19:46:23.0663670Z 2025-05-07T19:46:23.0663674Z 2025-05-07T19:46:23.0663677Z 2025-05-07T19:46:23.0663680Z 2025-05-07T19:46:23.0663684Z 2025-05-07T19:46:23.0663787Z  2025-05-07T19:46:23.0663912Z 2025-05-07T19:46:23.0663916Z 2025-05-07T19:46:23.0663919Z 2025-05-07T19:46:23.0663923Z 2025-05-07T19:46:23.0663926Z 2025-05-07T19:46:23.0663943Z 2025-05-07T19:46:23.0664051Z  2025-05-07T19:46:23.0664183Z 2025-05-07T19:46:23.0664187Z 2025-05-07T19:46:23.0664190Z 2025-05-07T19:46:23.0664193Z 2025-05-07T19:46:23.0664197Z 2025-05-07T19:46:23.0664200Z 2025-05-07T19:46:23.0664203Z 2025-05-07T19:46:23.0664330Z  2025-05-07T19:46:23.0664470Z 2025-05-07T19:46:23.0664474Z 2025-05-07T19:46:23.0664477Z 2025-05-07T19:46:23.0664481Z 2025-05-07T19:46:23.0664484Z 2025-05-07T19:46:23.0664563Z 2025-05-07T19:46:23.0664567Z 2025-05-07T19:46:23.0664570Z 2025-05-07T19:46:23.0664686Z  2025-05-07T19:46:23.0664853Z 2025-05-07T19:46:23.0664857Z 2025-05-07T19:46:23.0664860Z 2025-05-07T19:46:23.0664863Z 2025-05-07T19:46:23.0664867Z 2025-05-07T19:46:23.0664870Z 2025-05-07T19:46:23.0664873Z 2025-05-07T19:46:23.0664877Z 2025-05-07T19:46:23.0664880Z 2025-05-07T19:46:23.0665001Z  2025-05-07T19:46:23.0665172Z 2025-05-07T19:46:23.0665176Z 2025-05-07T19:46:23.0665180Z 2025-05-07T19:46:23.0665183Z 2025-05-07T19:46:23.0665190Z 2025-05-07T19:46:23.0665194Z 2025-05-07T19:46:23.0665197Z 2025-05-07T19:46:23.0665200Z 2025-05-07T19:46:23.0665204Z 2025-05-07T19:46:23.0665207Z 2025-05-07T19:46:23.0665331Z  2025-05-07T19:46:23.0665511Z 2025-05-07T19:46:23.0665515Z 2025-05-07T19:46:23.0665518Z 2025-05-07T19:46:23.0665522Z 2025-05-07T19:46:23.0665525Z 2025-05-07T19:46:23.0665529Z 2025-05-07T19:46:23.0665536Z 2025-05-07T19:46:23.0665539Z 2025-05-07T19:46:23.0665543Z 2025-05-07T19:46:23.0665546Z 2025-05-07T19:46:23.0665549Z 2025-05-07T19:46:23.0665675Z  2025-05-07T19:46:23.0665866Z 2025-05-07T19:46:23.0665869Z 2025-05-07T19:46:23.0665873Z 2025-05-07T19:46:23.0665876Z 2025-05-07T19:46:23.0665880Z 2025-05-07T19:46:23.0665883Z 2025-05-07T19:46:23.0665887Z 2025-05-07T19:46:23.0665890Z 2025-05-07T19:46:23.0665894Z 2025-05-07T19:46:23.0665897Z 2025-05-07T19:46:23.0665900Z 2025-05-07T19:46:23.0665904Z 2025-05-07T19:46:23.0666104Z  2025-05-07T19:46:23.0666305Z 2025-05-07T19:46:23.0666309Z 2025-05-07T19:46:23.0666313Z 2025-05-07T19:46:23.0666316Z 2025-05-07T19:46:23.0666319Z 2025-05-07T19:46:23.0666323Z 2025-05-07T19:46:23.0666326Z 2025-05-07T19:46:23.0666330Z 2025-05-07T19:46:23.0666333Z 2025-05-07T19:46:23.0666337Z 2025-05-07T19:46:23.0666340Z 2025-05-07T19:46:23.0666343Z 2025-05-07T19:46:23.0666347Z 2025-05-07T19:46:23.0666498Z  2025-05-07T19:46:23.0666690Z 2025-05-07T19:46:23.0666693Z 2025-05-07T19:46:23.0666697Z 2025-05-07T19:46:23.0666700Z 2025-05-07T19:46:23.0666704Z 2025-05-07T19:46:23.0666707Z 2025-05-07T19:46:23.0666710Z 2025-05-07T19:46:23.0666713Z 2025-05-07T19:46:23.0666717Z 2025-05-07T19:46:23.0666720Z 2025-05-07T19:46:23.0666723Z 2025-05-07T19:46:23.0666727Z 2025-05-07T19:46:23.0666730Z 2025-05-07T19:46:23.0666733Z 2025-05-07T19:46:23.0666886Z  2025-05-07T19:46:23.0667083Z 2025-05-07T19:46:23.0667086Z 2025-05-07T19:46:23.0667094Z 2025-05-07T19:46:23.0667097Z 2025-05-07T19:46:23.0667101Z 2025-05-07T19:46:23.0667104Z 2025-05-07T19:46:23.0667107Z 2025-05-07T19:46:23.0667111Z 2025-05-07T19:46:23.0667114Z 2025-05-07T19:46:23.0667118Z 2025-05-07T19:46:23.0667121Z 2025-05-07T19:46:23.0667124Z 2025-05-07T19:46:23.0667128Z 2025-05-07T19:46:23.0667145Z 2025-05-07T19:46:23.0667148Z 2025-05-07T19:46:23.0667292Z  2025-05-07T19:46:23.0667498Z 2025-05-07T19:46:23.0667501Z 2025-05-07T19:46:23.0667505Z 2025-05-07T19:46:23.0667508Z 2025-05-07T19:46:23.0667512Z 2025-05-07T19:46:23.0667515Z 2025-05-07T19:46:23.0667518Z 2025-05-07T19:46:23.0667522Z 2025-05-07T19:46:23.0667525Z 2025-05-07T19:46:23.0667542Z 2025-05-07T19:46:23.0667546Z 2025-05-07T19:46:23.0667549Z 2025-05-07T19:46:23.0667552Z 2025-05-07T19:46:23.0667556Z 2025-05-07T19:46:23.0667559Z 2025-05-07T19:46:23.0667562Z 2025-05-07T19:46:23.0667711Z  2025-05-07T19:46:23.0667923Z 2025-05-07T19:46:23.0667927Z 2025-05-07T19:46:23.0667930Z 2025-05-07T19:46:23.0667950Z 2025-05-07T19:46:23.0667954Z 2025-05-07T19:46:23.0667957Z 2025-05-07T19:46:23.0667960Z 2025-05-07T19:46:23.0667963Z 2025-05-07T19:46:23.0667967Z 2025-05-07T19:46:23.0667970Z 2025-05-07T19:46:23.0667974Z 2025-05-07T19:46:23.0667977Z 2025-05-07T19:46:23.0667981Z 2025-05-07T19:46:23.0667984Z 2025-05-07T19:46:23.0668041Z 2025-05-07T19:46:23.0668044Z 2025-05-07T19:46:23.0668047Z 2025-05-07T19:46:23.0668203Z  2025-05-07T19:46:23.0668432Z 2025-05-07T19:46:23.0668435Z 2025-05-07T19:46:23.0668439Z 2025-05-07T19:46:23.0668442Z 2025-05-07T19:46:23.0668446Z 2025-05-07T19:46:23.0668449Z 2025-05-07T19:46:23.0668452Z 2025-05-07T19:46:23.0668456Z 2025-05-07T19:46:23.0668459Z 2025-05-07T19:46:23.0668462Z 2025-05-07T19:46:23.0668466Z 2025-05-07T19:46:23.0668469Z 2025-05-07T19:46:23.0668472Z 2025-05-07T19:46:23.0668476Z 2025-05-07T19:46:23.0668483Z 2025-05-07T19:46:23.0668486Z 2025-05-07T19:46:23.0668490Z 2025-05-07T19:46:23.0668493Z 2025-05-07T19:46:23.0668667Z  2025-05-07T19:46:23.0668883Z 2025-05-07T19:46:23.0668886Z 2025-05-07T19:46:23.0668981Z  2025-05-07T19:46:23.0669107Z 2025-05-07T19:46:23.0669111Z 2025-05-07T19:46:23.0669218Z  2025-05-07T19:46:23.0669327Z 2025-05-07T19:46:23.0669334Z 2025-05-07T19:46:23.0669338Z 2025-05-07T19:46:23.0669450Z  2025-05-07T19:46:23.0669558Z 2025-05-07T19:46:23.0669561Z 2025-05-07T19:46:23.0669565Z 2025-05-07T19:46:23.0669568Z 2025-05-07T19:46:23.0669669Z  2025-05-07T19:46:23.0669799Z 2025-05-07T19:46:23.0669802Z 2025-05-07T19:46:23.0669806Z 2025-05-07T19:46:23.0669809Z 2025-05-07T19:46:23.0669813Z 2025-05-07T19:46:23.0669916Z  2025-05-07T19:46:23.0670039Z 2025-05-07T19:46:23.0670043Z 2025-05-07T19:46:23.0670047Z 2025-05-07T19:46:23.0670050Z 2025-05-07T19:46:23.0670053Z 2025-05-07T19:46:23.0670121Z 2025-05-07T19:46:23.0670231Z  2025-05-07T19:46:23.0670360Z 2025-05-07T19:46:23.0670364Z 2025-05-07T19:46:23.0670368Z 2025-05-07T19:46:23.0670371Z 2025-05-07T19:46:23.0670374Z 2025-05-07T19:46:23.0670378Z 2025-05-07T19:46:23.0670381Z 2025-05-07T19:46:23.0670521Z  done 2025-05-07T19:46:23.2788386Z Preparing transaction: / - done 2025-05-07T19:46:23.9818429Z Verifying transaction: | / - \ | / - done 2025-05-07T19:46:24.2863593Z Executing transaction: | / - done 2025-05-07T19:46:26.0419420Z [INSTALL] Fixing file placements for CUDA 12.6.3+ ... 2025-05-07T19:46:26.0420845Z [INSTALL] Creating symlinks: libnvToolsExt.so 2025-05-07T19:46:26.0423362Z + ln -sf /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:26.0425066Z 2025-05-07T19:46:26.0430698Z 2025-05-07T19:46:26.0431723Z + ln -sf /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:26.0432507Z 2025-05-07T19:46:26.0448159Z 2025-05-07T19:46:26.0448466Z [INSTALL] Copying nvtx3 headers ... 2025-05-07T19:46:26.0453045Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/include/ 2025-05-07T19:46:26.0457112Z 2025-05-07T19:46:26.0665199Z 2025-05-07T19:46:26.0671599Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/ 2025-05-07T19:46:26.0676019Z 2025-05-07T19:46:26.0687685Z 2025-05-07T19:46:26.0688029Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:46:26.1085577Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs ... 2025-05-07T19:46:27.7415282Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs 2025-05-07T19:46:27.7417602Z 2025-05-07T19:46:28.1527935Z 2025-05-07T19:46:28.1531492Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:46:28.1908577Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:28.1910171Z 2025-05-07T19:46:28.5968602Z 2025-05-07T19:46:28.5969867Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:46:28.5970879Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:46:28.5971757Z 2025-05-07T19:46:29.0071672Z 2025-05-07T19:46:30.7319646Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/cuda_runtime.h 2025-05-07T19:46:32.4463633Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:46:34.1621616Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:34.1624531Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:35.8782450Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:37.4553677Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:46:37.4554185Z 2025-05-07T19:46:37.5127925Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:46:40.7526678Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:46:40.7528551Z Target: x86_64-conda-linux-gnu 2025-05-07T19:46:40.7529339Z Thread model: posix 2025-05-07T19:46:40.7529866Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:46:40.7530591Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang.cfg 2025-05-07T19:46:40.7531014Z 2025-05-07T19:46:40.8084500Z [INSTALL] Resetting compiler symlinks to clang ... 2025-05-07T19:46:44.1494629Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:46:44.1495200Z 2025-05-07T19:46:44.1504908Z 2025-05-07T19:46:44.1525588Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:46:44.1527085Z 2025-05-07T19:46:44.1540977Z 2025-05-07T19:46:44.1559582Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:46:44.1560156Z 2025-05-07T19:46:44.1572387Z 2025-05-07T19:46:44.1588326Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:46:44.1588893Z 2025-05-07T19:46:44.1604473Z 2025-05-07T19:46:44.1604980Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:46:44.1605412Z 2025-05-07T19:46:44.1620902Z total 20 2025-05-07T19:46:44.1621216Z drwxr-xr-x. 2 root root 154 May 7 19:46 . 2025-05-07T19:46:44.1621603Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:46:44.1622270Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:46:44.1622775Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:46:44.1623214Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:46:44.1623630Z -rw-r--r--. 2 root root 499 Nov 30 04:26 openjdk_activate.sh 2025-05-07T19:46:44.1624064Z -rw-r--r--. 2 root root 2932 Nov 20 20:32 ~cuda-nvcc_activate.sh 2025-05-07T19:46:44.1624340Z 2025-05-07T19:46:44.1624576Z [INSTALL] Removing the -ccbin=CXX hook from NVCC activation scripts ... 2025-05-07T19:46:44.1625266Z + sed -i /-ccbin=/d /github/home/miniconda/envs/build_binary/etc/conda/activate.d/*cuda-nvcc_activate.sh 2025-05-07T19:46:44.1625714Z 2025-05-07T19:46:44.1637591Z 2025-05-07T19:46:44.1637801Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:46:44.1638090Z 2025-05-07T19:46:45.8189928Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:46:45.8193161Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:46:45.8193755Z 2025-05-07T19:46:45.8193922Z [BUILD] Setting Clang as the NVCC host compiler: 2025-05-07T19:46:47.4483447Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:46:47.4484958Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++" 2025-05-07T19:46:47.4485806Z 2025-05-07T19:46:47.8578358Z 2025-05-07T19:46:47.8579386Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:46:47.8580459Z 2025-05-07T19:46:49.4337612Z -allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:46:49.4339223Z 2025-05-07T19:46:49.4905308Z 2025-05-07T19:46:49.4906392Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:46:49.4907974Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:46:49.4909010Z 2025-05-07T19:46:51.1780894Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:46:51.1781846Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:46:51.1782639Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:46:51.1783354Z #define ADJ_MICRO 0x1000 2025-05-07T19:46:51.1784058Z #define ADJ_NANO 0x2000 2025-05-07T19:46:51.1784735Z #define ADJ_OFFSET 0x0001 2025-05-07T19:46:51.1785530Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:46:51.1786440Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:46:51.1787245Z #define ADJ_STATUS 0x0010 2025-05-07T19:46:51.1787941Z #define ADJ_TAI 0x0080 2025-05-07T19:46:51.1788584Z #define ADJ_TICK 0x4000 2025-05-07T19:46:51.1789288Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:46:51.1790051Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:46:51.1790903Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:46:51.1791785Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:46:51.1792765Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:46:51.1793112Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:46:51.1793445Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:46:51.1793728Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:46:51.1793991Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:46:51.1794285Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:46:51.1794577Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:46:51.1794863Z #define CHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:51.1795129Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:46:51.1795822Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:46:51.1796097Z #define CLOCK_BOOTTIME 7 2025-05-07T19:46:51.1796380Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:46:51.1796653Z #define CLOCK_MONOTONIC 1 2025-05-07T19:46:51.1796961Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:46:51.1797267Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:46:51.1797595Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:46:51.1797931Z #define CLOCK_REALTIME 0 2025-05-07T19:46:51.1798565Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:46:51.1798901Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:46:51.1799199Z #define CLOCK_TAI 11 2025-05-07T19:46:51.1799482Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:46:51.1799790Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:46:51.1800099Z #define CUDARTAPI 2025-05-07T19:46:51.1800342Z #define CUDARTAPI_CDECL 2025-05-07T19:46:51.1800610Z #define CUDART_CB 2025-05-07T19:46:51.1800858Z #define CUDART_DEVICE __device__ 2025-05-07T19:46:51.1801159Z #define CUDART_VERSION 12060 2025-05-07T19:46:51.1801455Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:46:51.1801765Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:46:51.1802059Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:46:51.1802353Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:46:51.1802630Z #define DOMAIN 1 2025-05-07T19:46:51.1802858Z #define EOF (-1) 2025-05-07T19:46:51.1803118Z #define EXIT_FAILURE 1 2025-05-07T19:46:51.1803365Z #define EXIT_SUCCESS 0 2025-05-07T19:46:51.1803680Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:46:51.1804217Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:46:51.1804608Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:46:51.1805029Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:46:51.1805374Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:46:51.1805757Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:46:51.1806079Z #define FILENAME_MAX 4096 2025-05-07T19:46:51.1806375Z #define FOPEN_MAX 16 2025-05-07T19:46:51.1806656Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:46:51.1806987Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:46:51.1828721Z #define FP_INFINITE 1 2025-05-07T19:46:51.1828996Z #define FP_NAN 0 2025-05-07T19:46:51.1829210Z #define FP_NORMAL 4 2025-05-07T19:46:51.1829452Z #define FP_SUBNORMAL 3 2025-05-07T19:46:51.1829692Z #define FP_ZERO 2 2025-05-07T19:46:51.1829907Z #define HOST_NAME_MAX 64 2025-05-07T19:46:51.1830161Z #define HUGE 3.40282347e+38F 2025-05-07T19:46:51.1830418Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:46:51.1830748Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:46:51.1831048Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:46:51.1831357Z #define INFINITY (__builtin_inff()) 2025-05-07T19:46:51.1831624Z #define INT_MAX __INT_MAX__ 2025-05-07T19:46:51.1831897Z #define INT_MIN (-__INT_MAX__ -1) 2025-05-07T19:46:51.1832154Z #define IOV_MAX 1024 2025-05-07T19:46:51.1832396Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:46:51.1832695Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:46:51.1832976Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:51.1833275Z #define LLONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:46:51.1833562Z #define LOGIN_NAME_MAX 256 2025-05-07T19:46:51.1833815Z #define LONG_BIT 64 2025-05-07T19:46:51.1834041Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:51.1834360Z #define LONG_LONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:46:51.1834663Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:46:51.1834936Z #define LONG_MIN (-__LONG_MAX__ -1L) 2025-05-07T19:46:51.1835217Z #define L_ctermid 9 2025-05-07T19:46:51.1835424Z #define L_cuserid 9 2025-05-07T19:46:51.1835641Z #define L_tmpnam 20 2025-05-07T19:46:51.1835853Z #define MATH_ERREXCEPT 2 2025-05-07T19:46:51.1836092Z #define MATH_ERRNO 1 2025-05-07T19:46:51.1836306Z #define MAX_CANON 255 2025-05-07T19:46:51.1836536Z #define MAX_INPUT 255 2025-05-07T19:46:51.1836779Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:46:51.1837089Z #define MB_LEN_MAX 16 2025-05-07T19:46:51.1837534Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:46:51.1837832Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:46:51.1838111Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:46:51.1838396Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:46:51.1838668Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:46:51.1838945Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:46:51.1839196Z #define MOD_NANO ADJ_NANO 2025-05-07T19:46:51.1839430Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:46:51.1839695Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:46:51.1839943Z #define MOD_TAI ADJ_TAI 2025-05-07T19:46:51.1840191Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:46:51.1840457Z #define MQ_PRIO_MAX 32768 2025-05-07T19:46:51.1840707Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:46:51.1841003Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:46:51.1841329Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:46:51.1841636Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:46:51.1841954Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:46:51.1842290Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:46:51.1842613Z #define M_E 2.7182818284590452354 2025-05-07T19:46:51.1842906Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:46:51.1843205Z #define M_LN10 2.30258509299404568402 2025-05-07T19:46:51.1843515Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:46:51.1843818Z #define M_LN2 0.69314718055994530942 2025-05-07T19:46:51.1844225Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:46:51.1844556Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:46:51.1844866Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:46:51.1845200Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:46:51.1845501Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:46:51.1845828Z #define M_PI 3.14159265358979323846 2025-05-07T19:46:51.1846089Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:46:51.1846404Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:46:51.1846710Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:46:51.1847018Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:46:51.1847365Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:46:51.1847669Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:46:51.1848172Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:46:51.1848505Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:46:51.1848840Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:46:51.1849158Z #define NAME_MAX 255 2025-05-07T19:46:51.1849404Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:46:51.1849675Z #define NFDBITS __NFDBITS 2025-05-07T19:46:51.1849931Z #define NGROUPS_MAX 65536 2025-05-07T19:46:51.1850180Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:46:51.1850471Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:46:51.1850766Z #define NL_MSGMAX INT_MAX 2025-05-07T19:46:51.1851005Z #define NL_NMAX INT_MAX 2025-05-07T19:46:51.1851258Z #define NL_SETMAX INT_MAX 2025-05-07T19:46:51.1851502Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:46:51.1851759Z #define NULL __null 2025-05-07T19:46:51.1851975Z #define NZERO 20 2025-05-07T19:46:51.1852207Z #define OVERFLOW 3 2025-05-07T19:46:51.1852424Z #define PATH_MAX 4096 2025-05-07T19:46:51.1852684Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:46:51.1852946Z #define PIPE_BUF 4096 2025-05-07T19:46:51.1853187Z #define PLOSS 6 2025-05-07T19:46:51.1853671Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:46:51.1854088Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:46:51.1854359Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:46:51.1854612Z #define P_tmpdir "/tmp" 2025-05-07T19:46:51.1854854Z #define RAND_MAX 2147483647 2025-05-07T19:46:51.1855092Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:46:51.1855341Z #define RTSIG_MAX 32 2025-05-07T19:46:51.1855663Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:51.1855942Z #define SCHAR_MIN (-__SCHAR_MAX__-1) 2025-05-07T19:46:51.1856204Z #define SEEK_CUR 1 2025-05-07T19:46:51.1856429Z #define SEEK_DATA 3 2025-05-07T19:46:51.1856634Z #define SEEK_END 2 2025-05-07T19:46:51.1856859Z #define SEEK_HOLE 4 2025-05-07T19:46:51.1857082Z #define SEEK_SET 0 2025-05-07T19:46:51.1857303Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:46:51.1857586Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:46:51.1857841Z #define SHRT_MIN (-__SHRT_MAX__ -1) 2025-05-07T19:46:51.1858111Z #define SING 2 2025-05-07T19:46:51.1858324Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:46:51.1858576Z #define STA_CLK 0x8000 2025-05-07T19:46:51.1858802Z #define STA_CLOCKERR 0x1000 2025-05-07T19:46:51.1859055Z #define STA_DEL 0x0020 2025-05-07T19:46:51.1859273Z #define STA_FLL 0x0008 2025-05-07T19:46:51.1859638Z #define STA_FREQHOLD 0x0080 2025-05-07T19:46:51.1860066Z #define STA_INS 0x0010 2025-05-07T19:46:51.1860306Z #define STA_MODE 0x4000 2025-05-07T19:46:51.1860571Z #define STA_NANO 0x2000 2025-05-07T19:46:51.1860862Z #define STA_PLL 0x0001 2025-05-07T19:46:51.1861117Z #define STA_PPSERROR 0x0800 2025-05-07T19:46:51.1861373Z #define STA_PPSFREQ 0x0002 2025-05-07T19:46:51.1861648Z #define STA_PPSJITTER 0x0200 2025-05-07T19:46:51.1861916Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:46:51.1862191Z #define STA_PPSTIME 0x0004 2025-05-07T19:46:51.1862443Z #define STA_PPSWANDER 0x0400 2025-05-07T19:46:51.1863021Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:46:51.1863741Z #define STA_UNSYNC 0x0040 2025-05-07T19:46:51.1863995Z #define TIMER_ABSTIME 1 2025-05-07T19:46:51.1864242Z #define TIME_UTC 1 2025-05-07T19:46:51.1864454Z #define TLOSS 5 2025-05-07T19:46:51.1864687Z #define TMP_MAX 238328 2025-05-07T19:46:51.1864920Z #define TTY_NAME_MAX 32 2025-05-07T19:46:51.1865185Z #define UCHAR_MAX (__SCHAR_MAX__*2 +1) 2025-05-07T19:46:51.1865491Z #define UINT_MAX (__INT_MAX__ *2U +1U) 2025-05-07T19:46:51.1865838Z #define ULLONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:46:51.1866313Z #define ULONG_LONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:46:51.1866655Z #define ULONG_MAX (__LONG_MAX__ *2UL+1UL) 2025-05-07T19:46:51.1866951Z #define UNDERFLOW 4 2025-05-07T19:46:51.1867169Z #define USHRT_MAX (__SHRT_MAX__ *2 +1) 2025-05-07T19:46:51.1867447Z #define WCONTINUED 8 2025-05-07T19:46:51.1867657Z #define WEXITED 4 2025-05-07T19:46:51.1867963Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:46:51.1868421Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:46:51.1868867Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:46:51.1869285Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:46:51.1869732Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:46:51.1870090Z #define WNOHANG 1 2025-05-07T19:46:51.1870301Z #define WNOWAIT 0x01000000 2025-05-07T19:46:51.1870548Z #define WORD_BIT 32 2025-05-07T19:46:51.1870753Z #define WSTOPPED 2 2025-05-07T19:46:51.1871038Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:46:51.1871427Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:46:51.1871766Z #define WUNTRACED 2 2025-05-07T19:46:51.1871981Z #define XATTR_LIST_MAX 65536 2025-05-07T19:46:51.1872238Z #define XATTR_NAME_MAX 255 2025-05-07T19:46:51.1872473Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:46:51.1872742Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:46:51.1873024Z #define _ACRTIMP 2025-05-07T19:46:51.1873231Z #define _ALLOCA_H 1 2025-05-07T19:46:51.1873628Z #define _ASSERT_H 1 2025-05-07T19:46:51.1873844Z #define _ATFILE_SOURCE 1 2025-05-07T19:46:51.1874102Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:46:51.1874357Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:46:51.1874625Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:46:51.1874886Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:46:51.1875160Z #define _BITS_TIMEX_H 1 2025-05-07T19:46:51.1875468Z #define _BITS_TIME_H 1 2025-05-07T19:46:51.1875719Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:46:51.1875968Z #define _BITS_TYPES_H 1 2025-05-07T19:46:51.1876212Z #define _BSD_SOURCE 1 2025-05-07T19:46:51.1876460Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:46:51.1876708Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:46:51.1876969Z #define _CRTIMP 2025-05-07T19:46:51.1877176Z #define _CTYPE_H 1 2025-05-07T19:46:51.1877402Z #define _ENDIAN_H 1 2025-05-07T19:46:51.1877626Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:46:51.1877902Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:46:51.1878162Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:46:51.1878416Z #define _FEATURES_H 1 2025-05-07T19:46:51.1878646Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:46:51.1878900Z #define _GCC_LIMITS_H_ 2025-05-07T19:46:51.1879190Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:46:51.1879653Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:51.1880109Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:46:51.1880401Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:46:51.1880698Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:46:51.1880985Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:46:51.1881277Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:46:51.1881560Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:46:51.1881875Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:46:51.1882303Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:46:51.1882813Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:51.1883240Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:46:51.1883502Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:46:51.1883782Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:46:51.1884076Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:51.1884392Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:46:51.1884667Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:46:51.1884949Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:46:51.1885218Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:46:51.1885499Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:46:51.1885841Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:46:51.1886229Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:46:51.1886530Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:46:51.1886833Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:46:51.1887144Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:46:51.1887498Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:46:51.1887860Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:46:51.1888263Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:46:51.1888700Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:46:51.1889008Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:46:51.1889275Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:46:51.1889548Z #define _GLIBCXX_CMATH 1 2025-05-07T19:46:51.1889813Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:46:51.1890148Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:46:51.1890427Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:46:51.1890692Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:46:51.1890932Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:46:51.1891217Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:46:51.1891518Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:46:51.1891828Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:46:51.1892129Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:46:51.1892418Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:46:51.1892741Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:46:51.1893090Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:46:51.1893495Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:46:51.1894031Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:46:51.1894614Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:46:51.1894903Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:46:51.1895181Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:46:51.1895487Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:46:51.1895784Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:46:51.1896089Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:46:51.1896467Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:46:51.1896893Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:46:51.1897181Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:46:51.1897462Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:46:51.1897723Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:46:51.1898113Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:46:51.1898487Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:46:51.1898778Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:46:51.1899062Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:46:51.1900178Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_##_NTYPE : false_type { }; template struct __has_##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:46:51.1901132Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:46:51.1901426Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:46:51.1901709Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:46:51.1902031Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:46:51.1902401Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:46:51.1902692Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:46:51.1902984Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:46:51.1903319Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:46:51.1903588Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:46:51.1903878Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:46:51.1904144Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:46:51.1904449Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:46:51.1904806Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:46:51.1905133Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:46:51.1905477Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:46:51.1905835Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:46:51.1906217Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:46:51.1906574Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:46:51.1906899Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:46:51.1907204Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:46:51.1907495Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:46:51.1907784Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:46:51.1908068Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:46:51.1908359Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:46:51.1908624Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:46:51.1908906Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:46:51.1909176Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:46:51.1909467Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:46:51.1909739Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:46:51.1910064Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:46:51.1910406Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:46:51.1910706Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:46:51.1910982Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:46:51.1911243Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:46:51.1911519Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:46:51.1911788Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:46:51.1912072Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:46:51.1912457Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:46:51.1912736Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:46:51.1912998Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:46:51.1913271Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:46:51.1913644Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:46:51.1913908Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:46:51.1914251Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:46:51.1914498Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:46:51.1914763Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:46:51.1915012Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:46:51.1915281Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:46:51.1915542Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:46:51.1915804Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:46:51.1916055Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:46:51.1916319Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:46:51.1916573Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:46:51.1916836Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:46:51.1917108Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:46:51.1917386Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:46:51.1917664Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:46:51.1917911Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:46:51.1918173Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:46:51.1918420Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:46:51.1918684Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:46:51.1918930Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:46:51.1919196Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:46:51.1919459Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:46:51.1919742Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:46:51.1920006Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:46:51.1920260Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:46:51.1920534Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:46:51.1920799Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:46:51.1921147Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:46:51.1921415Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:46:51.1921696Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:46:51.1922111Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:46:51.1922579Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:46:51.1922888Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:46:51.1923252Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:46:51.1923552Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:46:51.1923868Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:46:51.1924147Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:46:51.1924405Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:46:51.1924683Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:46:51.1924965Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:46:51.1925254Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:46:51.1925708Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:46:51.1925985Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:46:51.1926249Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:46:51.1926532Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:46:51.1926823Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:46:51.1927126Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:46:51.1927445Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:46:51.1927751Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:46:51.1928064Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:46:51.1928329Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:46:51.1928619Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:46:51.1928924Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:46:51.1929241Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:46:51.1929518Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:46:51.1929785Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:46:51.1930084Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:46:51.1930357Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:46:51.1930636Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:46:51.1930904Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:46:51.1931191Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:46:51.1931459Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:46:51.1931736Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:46:51.1932001Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:46:51.1932280Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:46:51.1932562Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:46:51.1932856Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:46:51.1933135Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:46:51.1933546Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:46:51.1933866Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:46:51.1934156Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:46:51.1934456Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:46:51.1934746Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:46:51.1935070Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:46:51.1935484Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:46:51.1935769Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:46:51.1936069Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:46:51.1936341Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:46:51.1936654Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:46:51.1936970Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:46:51.1937264Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:46:51.1937608Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:46:51.1938008Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:46:51.1938300Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:46:51.1938600Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:46:51.1938910Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:46:51.1939202Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:46:51.1939573Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:46:51.1939874Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:46:51.1940362Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:46:51.1940654Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:46:51.1940968Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:46:51.1941372Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:46:51.1941677Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:46:51.1941954Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:46:51.1942235Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:46:51.1942515Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:46:51.1942778Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:46:51.1943051Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:46:51.1943315Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:46:51.1943624Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:46:51.1943886Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:46:51.1944176Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:46:51.1944452Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:46:51.1944745Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:46:51.1945018Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:46:51.1945309Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:46:51.1945605Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:46:51.1945871Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:46:51.1946158Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:46:51.1946428Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:46:51.1946715Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:46:51.1946982Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:46:51.1947266Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:46:51.1947521Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:46:51.1947802Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:46:51.1948088Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:46:51.1948610Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:46:51.1949264Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:46:51.1949690Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:46:51.1949978Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:46:51.1950255Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:46:51.1950656Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:46:51.1951160Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:46:51.1951630Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:46:51.1951947Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:46:51.1952417Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:46:51.1952959Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:46:51.1953491Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:46:51.1953812Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:46:51.1954121Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:46:51.1954457Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:46:51.1954755Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:46:51.1955102Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:46:51.1955455Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:46:51.1955850Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:46:51.1956226Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:46:51.1956471Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:46:51.1956723Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:46:51.1957017Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:46:51.1957400Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:46:51.1957773Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:46:51.1958068Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:46:51.1958389Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:46:51.1958728Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:46:51.1959011Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:46:51.1959296Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:46:51.1959591Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:46:51.1959821Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:46:51.1960064Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:46:51.1960363Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:46:51.1960612Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:46:51.1960869Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:46:51.1961109Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:46:51.1961339Z #define _GLIBCXX_STD_A std 2025-05-07T19:46:51.1961555Z #define _GLIBCXX_STD_C std 2025-05-07T19:46:51.1961783Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:46:51.1962006Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:46:51.1962293Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:46:51.1962636Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:46:51.1962949Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:46:51.1963226Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:46:51.1963554Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:46:51.1963855Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:46:51.1964131Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:46:51.1964414Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:46:51.1964682Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:46:51.1964946Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:46:51.1965241Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:46:51.1965555Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:46:51.1965858Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:46:51.1966161Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:46:51.1966468Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:46:51.1966771Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:46:51.1967260Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:46:51.1967508Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:46:51.1967772Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:46:51.1968033Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:46:51.1968314Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:46:51.1968615Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:46:51.1968987Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:46:51.1969277Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:46:51.1969573Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:46:51.1969868Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:46:51.1970179Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:46:51.1970540Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:46:51.1970871Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:46:51.1973656Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:46:51.1973985Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:46:51.1974385Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:46:51.1974771Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:46:51.1975118Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:46:51.1975412Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:46:51.1975703Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:46:51.1975997Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:46:51.1976268Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:46:51.1976561Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:46:51.1976832Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:46:51.1977111Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:46:51.1977368Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:46:51.1977626Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:46:51.1977883Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:46:51.1978144Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:46:51.1978424Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:46:51.1978679Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:46:51.1978936Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:46:51.1979188Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:46:51.1979451Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:46:51.1979809Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:46:51.1980282Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:46:51.1980611Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:46:51.1981067Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:46:51.1981397Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:46:51.1981687Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:46:51.1982040Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:46:51.1982375Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:46:51.1982702Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:46:51.1983016Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:46:51.1983407Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:46:51.1983809Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:46:51.1984115Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:46:51.1984401Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:46:51.1984724Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:46:51.1985048Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:46:51.1985369Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:46:51.1985658Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:46:51.1986030Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:46:51.1986485Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:46:51.1986779Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:46:51.1987080Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:46:51.1987352Z #define _GNU_SOURCE 1 2025-05-07T19:46:51.1987636Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:46:51.1987938Z #define _G_BUFSIZ 8192 2025-05-07T19:46:51.1988209Z #define _G_HAVE_MMAP 1 2025-05-07T19:46:51.1988492Z #define _G_HAVE_MREMAP 1 2025-05-07T19:46:51.1988812Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:46:51.1989214Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:46:51.1989511Z #define _G_config_h 1 2025-05-07T19:46:51.1989783Z #define _G_va_list __gnuc_va_list 2025-05-07T19:46:51.1990070Z #define _INITIALIZER_LIST 2025-05-07T19:46:51.1990351Z #define _IOFBF 0 2025-05-07T19:46:51.1990575Z #define _IOLBF 1 2025-05-07T19:46:51.1990814Z #define _IONBF 2 2025-05-07T19:46:51.1991044Z #define _IOS_APPEND 8 2025-05-07T19:46:51.1991313Z #define _IOS_ATEND 4 2025-05-07T19:46:51.1991574Z #define _IOS_BIN 128 2025-05-07T19:46:51.1991808Z #define _IOS_INPUT 1 2025-05-07T19:46:51.1992070Z #define _IOS_NOCREATE 32 2025-05-07T19:46:51.1992461Z #define _IOS_NOREPLACE 64 2025-05-07T19:46:51.1992891Z #define _IOS_OUTPUT 2 2025-05-07T19:46:51.1993132Z #define _IOS_TRUNC 16 2025-05-07T19:46:51.1993401Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:46:51.1993718Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:46:51.1994175Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:46:51.1994456Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:46:51.1994774Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:46:51.1995087Z #define _IO_DEC 020 2025-05-07T19:46:51.1995333Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:46:51.1995659Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:46:51.1995932Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:46:51.1996220Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:46:51.1996474Z #define _IO_FIXED 010000 2025-05-07T19:46:51.1996748Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:46:51.1997012Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:46:51.1997310Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:46:51.1997612Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:46:51.1997961Z #define _IO_HEX 0100 2025-05-07T19:46:51.1998208Z #define _IO_INTERNAL 010 2025-05-07T19:46:51.1998499Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:46:51.1998800Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:46:51.1999085Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:46:51.1999370Z #define _IO_LEFT 02 2025-05-07T19:46:51.1999610Z #define _IO_LINE_BUF 0x200 2025-05-07T19:46:51.1999891Z #define _IO_LINKED 0x80 2025-05-07T19:46:51.2000144Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:46:51.2000446Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:46:51.2000729Z #define _IO_NO_READS 4 2025-05-07T19:46:51.2001004Z #define _IO_NO_WRITES 8 2025-05-07T19:46:51.2001248Z #define _IO_OCT 040 2025-05-07T19:46:51.2001652Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:46:51.2002181Z #define _IO_RIGHT 04 2025-05-07T19:46:51.2002430Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:46:51.2002730Z #define _IO_SHOWBASE 0200 2025-05-07T19:46:51.2002994Z #define _IO_SHOWPOINT 0400 2025-05-07T19:46:51.2003289Z #define _IO_SHOWPOS 02000 2025-05-07T19:46:51.2003543Z #define _IO_SKIPWS 01 2025-05-07T19:46:51.2003821Z #define _IO_STDIO 040000 2025-05-07T19:46:51.2004077Z #define _IO_STDIO_H 2025-05-07T19:46:51.2004362Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:46:51.2004643Z #define _IO_UNBUFFERED 2 2025-05-07T19:46:51.2005041Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:46:51.2005338Z #define _IO_UNITBUF 020000 2025-05-07T19:46:51.2005584Z #define _IO_UPPERCASE 01000 2025-05-07T19:46:51.2005866Z #define _IO_USER_BUF 1 2025-05-07T19:46:51.2006099Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:46:51.2006378Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:46:51.2006672Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:46:51.2007061Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:46:51.2007520Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:46:51.2007907Z #define _IO_file_flags _flags 2025-05-07T19:46:51.2008155Z #define _IO_flockfile(_fp) 2025-05-07T19:46:51.2008410Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:46:51.2008678Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:46:51.2008928Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:46:51.2009204Z #define _IO_funlockfile(_fp) 2025-05-07T19:46:51.2009711Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:46:51.2010278Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:46:51.2010547Z #define _IO_off64_t __off64_t 2025-05-07T19:46:51.2010830Z #define _IO_off_t __off_t 2025-05-07T19:46:51.2011112Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:46:51.2011748Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:46:51.2012329Z #define _IO_pid_t __pid_t 2025-05-07T19:46:51.2012921Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:46:51.2013564Z #define _IO_size_t size_t 2025-05-07T19:46:51.2013891Z #define _IO_ssize_t __ssize_t 2025-05-07T19:46:51.2014185Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:46:51.2014533Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:46:51.2014862Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:46:51.2015180Z #define _IO_uid_t __uid_t 2025-05-07T19:46:51.2015423Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:46:51.2015711Z #define _IO_wint_t wint_t 2025-05-07T19:46:51.2015937Z #define _ISOC11_SOURCE 1 2025-05-07T19:46:51.2016183Z #define _ISOC95_SOURCE 1 2025-05-07T19:46:51.2016419Z #define _ISOC99_SOURCE 1 2025-05-07T19:46:51.2016760Z #define _ISbit(bit) ((bit) < 8 ? ((1 << (bit)) << 8) : ((1 << (bit)) >> 8)) 2025-05-07T19:46:51.2017122Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:46:51.2017383Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:46:51.2017634Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:46:51.2017861Z #define _LINUX_LIMITS_H 2025-05-07T19:46:51.2018095Z #define _LP64 1 2025-05-07T19:46:51.2018293Z #define _MATH_H 1 2025-05-07T19:46:51.2018511Z #define _MATH_H_MATHDEF 1 2025-05-07T19:46:51.2018733Z #define _MOVE_H 1 2025-05-07T19:46:51.2018949Z #define _Mfloat_ float 2025-05-07T19:46:51.2019178Z #define _Mlong_double_ long double 2025-05-07T19:46:51.2019442Z #define _NEW 2025-05-07T19:46:51.2019733Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:46:51.2020200Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:46:51.2020471Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:46:51.2020755Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:46:51.2021140Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:46:51.2021504Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:46:51.2021820Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:46:51.2022276Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:46:51.2022567Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:46:51.2022830Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:46:51.2023112Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:46:51.2023372Z #define _POSIX_AIO_MAX 1 2025-05-07T19:46:51.2023637Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:46:51.2023893Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:46:51.2024170Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:46:51.2024474Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:46:51.2024746Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:46:51.2025053Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:46:51.2025372Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:46:51.2025685Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:46:51.2025959Z #define _POSIX_LINK_MAX 8 2025-05-07T19:46:51.2026235Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:46:51.2026501Z #define _POSIX_MAX_CANON 255 2025-05-07T19:46:51.2026961Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:46:51.2027220Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:46:51.2027494Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:46:51.2027773Z #define _POSIX_NAME_MAX 14 2025-05-07T19:46:51.2028025Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:46:51.2028298Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:46:51.2028553Z #define _POSIX_PATH_MAX 256 2025-05-07T19:46:51.2028818Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:46:51.2029069Z #define _POSIX_QLIMIT 1 2025-05-07T19:46:51.2029327Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:46:51.2029591Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:46:51.2029863Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:46:51.2030145Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:46:51.2030447Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:46:51.2030726Z #define _POSIX_SOURCE 1 2025-05-07T19:46:51.2030970Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:46:51.2031253Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:46:51.2031512Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:46:51.2031803Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:46:51.2032101Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:46:51.2032444Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:46:51.2032735Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:46:51.2033048Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:46:51.2033429Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:46:51.2033714Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:46:51.2034137Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:46:51.2034574Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:46:51.2035055Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:46:51.2035628Z #define _PSTL_CLANG_VERSION (__clang_major__ * 10000 + __clang_minor__ * 100 + __clang_patchlevel__) 2025-05-07T19:46:51.2036103Z #define _PSTL_CONFIG_H 2025-05-07T19:46:51.2036543Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:46:51.2037361Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:46:51.2038130Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:46:51.2038859Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:46:51.2039789Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:46:51.2040480Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:46:51.2040913Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:51.2041470Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:51.2041894Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:46:51.2042169Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:46:51.2042494Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:46:51.2042915Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:51.2043253Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:46:51.2043535Z #define _PSTL_PRAGMA(x) _Pragma(# x) 2025-05-07T19:46:51.2044161Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:46:51.2044854Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:46:51.2045237Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:46:51.2045562Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:46:51.2045918Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:46:51.2046394Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:46:51.2046919Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:46:51.2047251Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:46:51.2047564Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:46:51.2047874Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:46:51.2048201Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:46:51.2048550Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:46:51.2048930Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:46:51.2049418Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:46:51.2049837Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:46:51.2050134Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:46:51.2050448Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:46:51.2050740Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:46:51.2051190Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:46:51.2051477Z #define _PSTL_UDR_PRESENT 0 2025-05-07T19:46:51.2051915Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:46:51.2052384Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:46:51.2052694Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:46:51.2053019Z #define _PSTL_VERSION 12000 2025-05-07T19:46:51.2053321Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:46:51.2053788Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:46:51.2054170Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:46:51.2054498Z #define _PTRDIFF_T 2025-05-07T19:46:51.2054719Z #define _PTR_TRAITS_H 1 2025-05-07T19:46:51.2054970Z #define _SIGSET_H_types 1 2025-05-07T19:46:51.2055290Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:46:51.2055662Z #define _SIZE_T 2025-05-07T19:46:51.2055876Z #define _STDC_PREDEF_H 1 2025-05-07T19:46:51.2056123Z #define _STDIO_H 1 2025-05-07T19:46:51.2056350Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:46:51.2056615Z #define _STDLIB_H 1 2025-05-07T19:46:51.2056858Z #define _STL_ALGOBASE_H 1 2025-05-07T19:46:51.2057105Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:46:51.2057405Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:46:51.2057682Z #define _STL_ITERATOR_H 1 2025-05-07T19:46:51.2057928Z #define _STL_PAIR_H 1 2025-05-07T19:46:51.2058151Z #define _STL_RELOPS_H 1 2025-05-07T19:46:51.2058392Z #define _STRING_H 1 2025-05-07T19:46:51.2058608Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:46:51.2058854Z #define _SVID_SOURCE 1 2025-05-07T19:46:51.2059078Z #define _SYS_CDEFS_H 1 2025-05-07T19:46:51.2059314Z #define _SYS_SELECT_H 1 2025-05-07T19:46:51.2059626Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:46:51.2059883Z #define _SYS_TYPES_H 1 2025-05-07T19:46:51.2060293Z #define _TIME_H 1 2025-05-07T19:46:51.2060512Z #define _VA_LIST_DEFINED 2025-05-07T19:46:51.2060770Z #define _XLOCALE_H 1 2025-05-07T19:46:51.2061117Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:46:51.2061434Z #define _XOPEN_LIM_H 1 2025-05-07T19:46:51.2061669Z #define _XOPEN_SOURCE 700 2025-05-07T19:46:51.2061942Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:46:51.2062306Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:46:51.2062782Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:46:51.2063162Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:46:51.2063516Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:46:51.2063835Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:46:51.2064085Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:46:51.2064344Z #define __ATOMIC_CONSUME 1 2025-05-07T19:46:51.2064601Z #define __ATOMIC_RELAXED 0 2025-05-07T19:46:51.2064861Z #define __ATOMIC_RELEASE 3 2025-05-07T19:46:51.2065111Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:46:51.2065387Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:46:51.2065677Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:46:51.2065969Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:46:51.2066244Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:46:51.2066532Z #define __BIG_ENDIAN 4321 2025-05-07T19:46:51.2066805Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:46:51.2067094Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:46:51.2067391Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:51.2067711Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.2068067Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.2068384Z #define __BOOL_WIDTH__ 8 2025-05-07T19:46:51.2068659Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:46:51.2068977Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:51.2069320Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:46:51.2069630Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:46:51.2069933Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:46:51.2070232Z #define __CHAR_BIT__ 8 2025-05-07T19:46:51.2070485Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:51.2070823Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:51.2071153Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:51.2071489Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:51.2071800Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:51.2072229Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:51.2072519Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:51.2072826Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:51.2073203Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:51.2073489Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:51.2073781Z #define __CLANG_LIMITS_H 2025-05-07T19:46:51.2074020Z #define __CLANG_MAX_ALIGN_T_DEFINED 2025-05-07T19:46:51.2074304Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:46:51.2074583Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.2074883Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:46:51.2075127Z #define __COMPAR_FN_T 2025-05-07T19:46:51.2075362Z #define __CONCAT(x,y) x ## y 2025-05-07T19:46:51.2075613Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:46:51.2075886Z #define __CUDACC_VER_BUILD__ 85 2025-05-07T19:46:51.2076146Z #define __CUDACC_VER_MAJOR__ 12 2025-05-07T19:46:51.2076394Z #define __CUDACC_VER_MINOR__ 6 2025-05-07T19:46:51.2076978Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:46:51.2077560Z #define __CUDACC__ 1 2025-05-07T19:46:51.2077800Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:46:51.2078070Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:46:51.2078500Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:46:51.2078952Z #define __CUDA_API_VER_MAJOR__ 12 2025-05-07T19:46:51.2079214Z #define __CUDA_API_VER_MINOR__ 6 2025-05-07T19:46:51.2079554Z #define __CUDA_ARCH_HAS_FEATURE__(_FEAT) __CUDA_ARCH_FEAT_##_FEAT 2025-05-07T19:46:51.2079907Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:46:51.2080227Z #define __CUDA_ARCH__ 520 2025-05-07T19:46:51.2080473Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:46:51.2080760Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:46:51.2081006Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:46:51.2081274Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:46:51.2081517Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:46:51.2081792Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:46:51.2082082Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:46:51.2082377Z #define __DBL_DIG__ 15 2025-05-07T19:46:51.2082627Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:46:51.2082913Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:46:51.2083169Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:46:51.2083414Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.2083672Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:46:51.2083909Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:46:51.2084332Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:46:51.2084595Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:46:51.2084900Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:46:51.2085179Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:46:51.2085445Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:46:51.2085768Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:46:51.2086071Z #define __DELETE_THROW throw() 2025-05-07T19:46:51.2086338Z #define __DEPRECATED 1 2025-05-07T19:46:51.2086585Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.2086905Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.2087202Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:46:51.2087518Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:46:51.2087823Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:46:51.2088095Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:46:51.2088383Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:46:51.2088671Z #define __DEVICE_TYPES_H__ 2025-05-07T19:46:51.2088935Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.2089207Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:46:51.2089481Z #define __DRIVER_TYPES_H__ 2025-05-07T19:46:51.2089717Z #define __ELF__ 1 2025-05-07T19:46:51.2089938Z #define __END_DECLS } 2025-05-07T19:46:51.2090169Z #define __END_NAMESPACE_C99 2025-05-07T19:46:51.2090441Z #define __END_NAMESPACE_STD 2025-05-07T19:46:51.2090711Z #define __EXCEPTIONS 1 2025-05-07T19:46:51.2090944Z #define __EXCEPTION_H 1 2025-05-07T19:46:51.2091208Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:46:51.2091712Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:46:51.2092140Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:46:51.2092525Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:46:51.2092983Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:46:51.2093422Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:46:51.2093832Z #define __FD_SETSIZE 1024 2025-05-07T19:46:51.2094511Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:46:51.2095216Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:46:51.2095491Z #define __FILE_defined 1 2025-05-07T19:46:51.2095735Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:46:51.2096119Z #define __FLOAT128__ 1 2025-05-07T19:46:51.2096355Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:46:51.2096648Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:46:51.2096947Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:46:51.2097246Z #define __FLT16_DIG__ 3 2025-05-07T19:46:51.2097495Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:46:51.2097773Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:46:51.2098042Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:46:51.2098301Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.2098635Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:46:51.2098881Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:46:51.2099147Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:46:51.2099637Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:46:51.2100099Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:46:51.2100394Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:46:51.2100707Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:46:51.2101025Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:46:51.2101303Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:46:51.2101616Z #define __FLT_DIG__ 6 2025-05-07T19:46:51.2101861Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:46:51.2102172Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:46:51.2102437Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:46:51.2102721Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.2102984Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:46:51.2103253Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:46:51.2103528Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:46:51.2103782Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:46:51.2104077Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:46:51.2104350Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:46:51.2104625Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:46:51.2104896Z #define __FLT_RADIX__ 2 2025-05-07T19:46:51.2105161Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.2105487Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.2105833Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.2106163Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.2106514Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:46:51.2106863Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.2107159Z #define __FXSR__ 1 2025-05-07T19:46:51.2107410Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:46:51.2107704Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:51.2108021Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:51.2108341Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:51.2108662Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:51.2108955Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:51.2123777Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:51.2124161Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:51.2124476Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:51.2124790Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:51.2125305Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:46:51.2125636Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:51.2125930Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:46:51.2126236Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:46:51.2126559Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:46:51.2126885Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:46:51.2127202Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:46:51.2127510Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:46:51.2127798Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:46:51.2128082Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:46:51.2128380Z #define __GLIBCXX__ 20230528 2025-05-07T19:46:51.2128634Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:46:51.2128903Z #define __GLIBC_MINOR__ 17 2025-05-07T19:46:51.2129303Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:51.2129935Z #define __GLIBC__ 2 2025-05-07T19:46:51.2130155Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:46:51.2130412Z #define __GNUC_MINOR__ 2 2025-05-07T19:46:51.2130663Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:46:51.2131051Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:51.2131486Z #define __GNUC_VA_LIST 2025-05-07T19:46:51.2131706Z #define __GNUC__ 4 2025-05-07T19:46:51.2131916Z #define __GNUG__ 4 2025-05-07T19:46:51.2132122Z #define __GNU_LIBRARY__ 6 2025-05-07T19:46:51.2132380Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:46:51.2132727Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:46:51.2133015Z #define __GXX_RTTI 1 2025-05-07T19:46:51.2133228Z #define __GXX_WEAK__ 1 2025-05-07T19:46:51.2133459Z #define __HAVE_COLUMN 2025-05-07T19:46:51.2133693Z #define __HOST_CONFIG_H__ 2025-05-07T19:46:51.2133929Z #define __HOST_DEFINES_H__ 2025-05-07T19:46:51.2134178Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:46:51.2134440Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.2134834Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:46:51.2135105Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.2135515Z #define __INT16_C_SUFFIX__ 2025-05-07T19:46:51.2135746Z #define __INT16_FMTd__ "hd" 2025-05-07T19:46:51.2135975Z #define __INT16_FMTi__ "hi" 2025-05-07T19:46:51.2136196Z #define __INT16_MAX__ 32767 2025-05-07T19:46:51.2136426Z #define __INT16_TYPE__ short 2025-05-07T19:46:51.2136656Z #define __INT32_C_SUFFIX__ 2025-05-07T19:46:51.2136881Z #define __INT32_FMTd__ "d" 2025-05-07T19:46:51.2137110Z #define __INT32_FMTi__ "i" 2025-05-07T19:46:51.2137332Z #define __INT32_MAX__ 2147483647 2025-05-07T19:46:51.2137573Z #define __INT32_TYPE__ int 2025-05-07T19:46:51.2137794Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:46:51.2138027Z #define __INT64_FMTd__ "ld" 2025-05-07T19:46:51.2138247Z #define __INT64_FMTi__ "li" 2025-05-07T19:46:51.2138486Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:46:51.2138754Z #define __INT64_TYPE__ long int 2025-05-07T19:46:51.2138996Z #define __INT8_C_SUFFIX__ 2025-05-07T19:46:51.2139217Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:46:51.2139443Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:46:51.2139756Z #define __INT8_MAX__ 127 2025-05-07T19:46:51.2140159Z #define __INT8_TYPE__ signed char 2025-05-07T19:46:51.2140435Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:46:51.2140689Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:46:51.2140944Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:46:51.2141202Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:46:51.2141505Z #define __INTMAX_TYPE__ long int 2025-05-07T19:46:51.2141761Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:46:51.2142014Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:46:51.2142259Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:46:51.2142527Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:46:51.2142818Z #define __INTPTR_TYPE__ long int 2025-05-07T19:46:51.2143092Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:46:51.2143346Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:46:51.2143691Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:46:51.2143958Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:46:51.2144053Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:46:51.2144145Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:46:51.2144246Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:46:51.2144337Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:46:51.2144434Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:46:51.2144523Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:46:51.2144624Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:46:51.2144720Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:46:51.2144814Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:46:51.2144942Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:46:51.2145045Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:46:51.2145136Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:46:51.2145228Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:46:51.2145329Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:46:51.2145424Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:46:51.2145525Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:46:51.2145623Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:46:51.2145714Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:46:51.2145807Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:46:51.2145897Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:46:51.2145996Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:46:51.2146088Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:46:51.2146176Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:46:51.2146329Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:46:51.2146423Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:46:51.2146514Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:46:51.2146603Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:46:51.2146701Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:46:51.2146792Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:46:51.2146907Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:46:51.2147017Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:46:51.2147105Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:46:51.2147197Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:46:51.2147287Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:46:51.2147390Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:46:51.2147488Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:46:51.2147582Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:46:51.2147678Z #define __INT_MAX__ 2147483647 2025-05-07T19:46:51.2147764Z #define __INT_WIDTH__ 32 2025-05-07T19:46:51.2147859Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:46:51.2147952Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:46:51.2148052Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:46:51.2148188Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:46:51.2148271Z #define __LDBL_DIG__ 18 2025-05-07T19:46:51.2148404Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:46:51.2148493Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:46:51.2148585Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:46:51.2148687Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.2148776Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:46:51.2148864Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:46:51.2148950Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:46:51.2149069Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:46:51.2149160Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:46:51.2149249Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:46:51.2149369Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:46:51.2149488Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:46:51.2149618Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:46:51.2149789Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:46:51.2149896Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:46:51.2150041Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:46:51.2150119Z #define __LEAF 2025-05-07T19:46:51.2150265Z #define __LEAF_ATTR 2025-05-07T19:46:51.2150358Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:46:51.2150447Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:46:51.2150534Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:46:51.2150632Z #define __LLONG_WIDTH__ 64 2025-05-07T19:46:51.2150744Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:46:51.2150842Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:46:51.2150948Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:46:51.2151034Z #define __LONG_WIDTH__ 64 2025-05-07T19:46:51.2151114Z #define __LP64__ 1 2025-05-07T19:46:51.2151449Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:46:51.2152120Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:46:51.2152330Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:46:51.2152424Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:46:51.2152521Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:46:51.2152595Z #define __MMX__ 1 2025-05-07T19:46:51.2152685Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:46:51.2152773Z #define __N(msgid) (msgid) 2025-05-07T19:46:51.2152886Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:46:51.2152995Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.2153075Z #define __NO_CTYPE 1 2025-05-07T19:46:51.2153273Z #define __NO_INLINE__ 1 2025-05-07T19:46:51.2153355Z #define __NO_MATH_INLINES 1 2025-05-07T19:46:51.2153522Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:46:51.2153626Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:46:51.2153698Z #define __NVCC__ 1 2025-05-07T19:46:51.2153791Z #define __NV_GLIBCXX_VERSION 40800 2025-05-07T19:46:51.2153877Z #define __NV_LEGACY_LAUNCH 1 2025-05-07T19:46:51.2153977Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:46:51.2154062Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:46:51.2154156Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:51.2154254Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:46:51.2154349Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.2154463Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:46:51.2154559Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:46:51.2154664Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:46:51.2154762Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:46:51.2154861Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:46:51.2154960Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:46:51.2155049Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:46:51.2155134Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:46:51.2155211Z #define __P(args) args 2025-05-07T19:46:51.2155299Z #define __PDP_ENDIAN 3412 2025-05-07T19:46:51.2155370Z #define __PIC__ 2 2025-05-07T19:46:51.2155455Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:46:51.2155533Z #define __PIE__ 2 2025-05-07T19:46:51.2155616Z #define __PMT(args) args 2025-05-07T19:46:51.2155697Z #define __POINTER_WIDTH__ 64 2025-05-07T19:46:51.2155787Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:46:51.2155886Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:46:51.2155984Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:46:51.2156066Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:46:51.2156165Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:46:51.2156249Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:46:51.2156343Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:46:51.2156442Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:46:51.2156526Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:46:51.2156732Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:51.2156929Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:46:51.2157178Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:51.2157480Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:51.2157698Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:46:51.2157796Z #define __REGISTER_PREFIX__ 2025-05-07T19:46:51.2157887Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.2157984Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.2158075Z #define __S16_TYPE short int 2025-05-07T19:46:51.2158154Z #define __S32_TYPE int 2025-05-07T19:46:51.2158235Z #define __S64_TYPE long int 2025-05-07T19:46:51.2158318Z #define __SCHAR_MAX__ 127 2025-05-07T19:46:51.2158404Z #define __SEG_FS 1 2025-05-07T19:46:51.2158478Z #define __SEG_GS 1 2025-05-07T19:46:51.2158558Z #define __SHRT_MAX__ 32767 2025-05-07T19:46:51.2158637Z #define __SHRT_WIDTH__ 16 2025-05-07T19:46:51.2158740Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:46:51.2158825Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:46:51.2158905Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:46:51.2159005Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:46:51.2159085Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:46:51.2159168Z #define __SIZEOF_INT128__ 16 2025-05-07T19:46:51.2159248Z #define __SIZEOF_INT__ 4 2025-05-07T19:46:51.2159345Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:46:51.2159432Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:46:51.2159511Z #define __SIZEOF_LONG__ 8 2025-05-07T19:46:51.2159603Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:46:51.2159696Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:46:51.2159793Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:46:51.2159950Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:46:51.2160044Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:46:51.2160134Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:46:51.2160228Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:46:51.2160325Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:46:51.2160423Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:46:51.2160513Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:46:51.2160612Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:46:51.2160695Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:46:51.2160778Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:46:51.2160859Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:46:51.2160949Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:46:51.2161031Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:46:51.2161113Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:46:51.2161202Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:46:51.2161282Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:46:51.2161380Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.2161475Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:46:51.2161564Z #define __SIZE_WIDTH__ 64 2025-05-07T19:46:51.2161646Z #define __SLONG32_TYPE int 2025-05-07T19:46:51.2161739Z #define __SLONGWORD_TYPE long int 2025-05-07T19:46:51.2161843Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.2161934Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.2162026Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:46:51.2162115Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:46:51.2162209Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:46:51.2162295Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:46:51.2162389Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.2162490Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.2162576Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:46:51.2162659Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:46:51.2162745Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.2162839Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:46:51.2162931Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.2163020Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.2163115Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:46:51.2163198Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:46:51.2163277Z #define __SM_70_RT_HPP__ 2025-05-07T19:46:51.2163351Z #define __SM_70_RT_H__ 2025-05-07T19:46:51.2163435Z #define __SM_80_RT_HPP__ 2025-05-07T19:46:51.2163563Z #define __SM_80_RT_H__ 2025-05-07T19:46:51.2163639Z #define __SM_90_RT_HPP__ 2025-05-07T19:46:51.2163726Z #define __SM_90_RT_H__ 2025-05-07T19:46:51.2163810Z #define __SQUAD_TYPE long int 2025-05-07T19:46:51.2163885Z #define __SSE2_MATH__ 1 2025-05-07T19:46:51.2163957Z #define __SSE2__ 1 2025-05-07T19:46:51.2164045Z #define __SSE_MATH__ 1 2025-05-07T19:46:51.2164118Z #define __SSE__ 1 2025-05-07T19:46:51.2164209Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:46:51.2164335Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:46:51.2164441Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:46:51.2164528Z #define __STDCPP_THREADS__ 1 2025-05-07T19:46:51.2164608Z #define __STDC_HOSTED__ 1 2025-05-07T19:46:51.2164707Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:46:51.2164786Z #define __STDC_IEC_559__ 1 2025-05-07T19:46:51.2164868Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:46:51.2164963Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:46:51.2165044Z #define __STDC_UTF_16__ 1 2025-05-07T19:46:51.2165122Z #define __STDC_UTF_32__ 1 2025-05-07T19:46:51.2165196Z #define __STDC__ 1 2025-05-07T19:46:51.2165282Z #define __STDDEF_H 2025-05-07T19:46:51.2165356Z #define __STRING(x) #x 2025-05-07T19:46:51.2165454Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:51.2165550Z #define __SURFACE_TYPES_H__ 2025-05-07T19:46:51.2165664Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.2165747Z #define __SWORD_TYPE long int 2025-05-07T19:46:51.2165853Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:46:51.2166021Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:46:51.2166108Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:46:51.2166203Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:51.2166289Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:46:51.2166383Z #define __THROW throw () 2025-05-07T19:46:51.2166464Z #define __THROWNL throw () 2025-05-07T19:46:51.2166547Z #define __TIMER_T_TYPE void * 2025-05-07T19:46:51.2166661Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.2166754Z #define __U16_TYPE unsigned short int 2025-05-07T19:46:51.2166838Z #define __U32_TYPE unsigned int 2025-05-07T19:46:51.2166928Z #define __U64_TYPE unsigned long int 2025-05-07T19:46:51.2167023Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:46:51.2167105Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:46:51.2167185Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:46:51.2167273Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:46:51.2167354Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:46:51.2167433Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:46:51.2167516Z #define __UINT16_MAX__ 65535 2025-05-07T19:46:51.2167616Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:46:51.2167697Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:46:51.2167776Z #define __UINT32_FMTX__ "X" 2025-05-07T19:46:51.2167863Z #define __UINT32_FMTo__ "o" 2025-05-07T19:46:51.2167942Z #define __UINT32_FMTu__ "u" 2025-05-07T19:46:51.2168021Z #define __UINT32_FMTx__ "x" 2025-05-07T19:46:51.2168109Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:46:51.2168203Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:46:51.2168285Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:46:51.2168364Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:46:51.2168451Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:46:51.2168534Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:46:51.2168613Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:46:51.2168708Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.2168812Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:46:51.2168898Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:46:51.2168976Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:46:51.2169066Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:46:51.2169146Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:46:51.2169228Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:46:51.2169305Z #define __UINT8_MAX__ 255 2025-05-07T19:46:51.2169404Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:46:51.2169490Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:46:51.2169631Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:46:51.2169723Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:46:51.2169807Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:46:51.2169889Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:46:51.2169988Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.2170095Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:46:51.2170180Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:46:51.2170263Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:46:51.2170354Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:46:51.2170439Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:46:51.2170524Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:46:51.2170621Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.2170730Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:46:51.2170815Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:46:51.2170902Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:46:51.2170998Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:46:51.2171086Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:46:51.2171170Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:46:51.2171254Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:46:51.2171363Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:46:51.2171450Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:46:51.2171535Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:46:51.2171628Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:46:51.2171710Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:46:51.2171800Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:46:51.2172305Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:46:51.2172390Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:46:51.2172474Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:46:51.2172557Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:46:51.2172648Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:46:51.2172757Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.2172866Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:46:51.2172962Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:46:51.2173045Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:46:51.2173129Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:46:51.2173213Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:46:51.2173306Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:46:51.2173568Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:46:51.2173665Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:46:51.2173763Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:46:51.2173854Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:46:51.2173947Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:46:51.2174036Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:46:51.2174153Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:46:51.2174241Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:46:51.2174328Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:46:51.2174422Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:46:51.2174507Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:46:51.2174605Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:46:51.2174707Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:46:51.2174807Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:46:51.2174895Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:46:51.2174981Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:46:51.2175076Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:46:51.2175194Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.2175311Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:46:51.2175417Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:46:51.2175504Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:46:51.2175592Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:46:51.2175679Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:46:51.2175776Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:46:51.2175879Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:46:51.2175971Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:46:51.2176149Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:46:51.2176244Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:46:51.2176336Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:46:51.2176426Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:46:51.2176513Z #define __USE_ANSI 1 2025-05-07T19:46:51.2176592Z #define __USE_ATFILE 1 2025-05-07T19:46:51.2176666Z #define __USE_BSD 1 2025-05-07T19:46:51.2176767Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:46:51.2176845Z #define __USE_GNU 1 2025-05-07T19:46:51.2176924Z #define __USE_ISOC11 1 2025-05-07T19:46:51.2177004Z #define __USE_ISOC95 1 2025-05-07T19:46:51.2177087Z #define __USE_ISOC99 1 2025-05-07T19:46:51.2177169Z #define __USE_ISOCXX11 1 2025-05-07T19:46:51.2177252Z #define __USE_LARGEFILE 1 2025-05-07T19:46:51.2177351Z #define __USE_LARGEFILE64 1 2025-05-07T19:46:51.2177427Z #define __USE_MISC 1 2025-05-07T19:46:51.2177507Z #define __USE_POSIX 1 2025-05-07T19:46:51.2177592Z #define __USE_POSIX199309 1 2025-05-07T19:46:51.2177688Z #define __USE_POSIX199506 1 2025-05-07T19:46:51.2177767Z #define __USE_POSIX2 1 2025-05-07T19:46:51.2177965Z #define __USE_SVID 1 2025-05-07T19:46:51.2178053Z #define __USE_UNIX98 1 2025-05-07T19:46:51.2178138Z #define __USE_XOPEN 1 2025-05-07T19:46:51.2178222Z #define __USE_XOPEN2K 1 2025-05-07T19:46:51.2178306Z #define __USE_XOPEN2K8 1 2025-05-07T19:46:51.2178402Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:46:51.2178489Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:46:51.2178581Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:46:51.2178683Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:46:51.2178843Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:46:51.2178944Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:46:51.2179034Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:46:51.2179134Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:46:51.2179224Z #define __VECTOR_TYPES_H__ 2025-05-07T19:46:51.2179745Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:46:51.2179880Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:46:51.2180143Z #define __WAIT_STATUS void * 2025-05-07T19:46:51.2180239Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:46:51.2180327Z #define __WALL 0x40000000 2025-05-07T19:46:51.2180429Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:46:51.2180518Z #define __WCHAR_TYPE__ int 2025-05-07T19:46:51.2180609Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:46:51.2180704Z #define __WCLONE 0x80000000 2025-05-07T19:46:51.2180840Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:46:51.2180931Z #define __WCOREFLAG 0x80 2025-05-07T19:46:51.2181078Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:46:51.2181237Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:46:51.2181376Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:46:51.2181598Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:46:51.2181754Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:46:51.2181843Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:46:51.2181937Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:46:51.2182039Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:46:51.2182124Z #define __WINT_WIDTH__ 32 2025-05-07T19:46:51.2182214Z #define __WNOTHREAD 0x20000000 2025-05-07T19:46:51.2182300Z #define __WORDSIZE 64 2025-05-07T19:46:51.2182410Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:46:51.2182534Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:46:51.2182650Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:46:51.2182754Z #define __W_CONTINUED 0xffff 2025-05-07T19:46:51.2182876Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:46:51.2182984Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:46:51.2183073Z #define ____FILE_defined 1 2025-05-07T19:46:51.2183180Z #define ____mbstate_t_defined 1 2025-05-07T19:46:51.2183298Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:46:51.2183546Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:46:51.2183639Z #define __amd64 1 2025-05-07T19:46:51.2183719Z #define __amd64__ 1 2025-05-07T19:46:51.2183824Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:46:51.2183920Z #define __attribute_artificial__ 2025-05-07T19:46:51.2184074Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:46:51.2184252Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:46:51.2184457Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:46:51.2184726Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:46:51.2184876Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:46:51.2185034Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:46:51.2185179Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:46:51.2185314Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:46:51.2185549Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:46:51.2185638Z #define __blkcnt_t_defined 2025-05-07T19:46:51.2185738Z #define __blksize_t_defined 2025-05-07T19:46:51.2185931Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:46:51.2186062Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:46:51.2186149Z #define __bounded 2025-05-07T19:46:51.2186819Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:46:51.2187316Z #define __bswap_32(x) (__extension__ ({ unsigned int __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_32 (__x); else __asm__ ("bswap %0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:46:51.2187802Z #define __bswap_64(x) (__extension__ ({ __uint64_t __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_64 (__x); else __asm__ ("bswap %q0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:46:51.2188062Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:46:51.2188395Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:46:51.2189384Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:46:51.2189489Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:46:51.2189584Z #define __catch(X) catch(X) 2025-05-07T19:46:51.2189670Z #define __cdecl 2025-05-07T19:46:51.2189746Z #define __clang__ 1 2025-05-07T19:46:51.2189853Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:46:51.2189950Z #define __clang_major__ 16 2025-05-07T19:46:51.2190032Z #define __clang_minor__ 0 2025-05-07T19:46:51.2190125Z #define __clang_patchlevel__ 6 2025-05-07T19:46:51.2190550Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:46:51.2190689Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:46:51.2190775Z #define __clock_t_defined 1 2025-05-07T19:46:51.2190865Z #define __clockid_t_defined 1 2025-05-07T19:46:51.2191067Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:46:51.2191160Z #define __code_model_small__ 1 2025-05-07T19:46:51.2191267Z #define __constant__ __location__(constant) 2025-05-07T19:46:51.2191366Z #define __cplusplus 201703L 2025-05-07T19:46:51.2191520Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:46:51.2191619Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:46:51.2191716Z #define __cpp_alias_templates 200704L 2025-05-07T19:46:51.2191821Z #define __cpp_aligned_new 201606L 2025-05-07T19:46:51.2191917Z #define __cpp_attributes 200809L 2025-05-07T19:46:51.2192017Z #define __cpp_binary_literals 201304L 2025-05-07T19:46:51.2192236Z #define __cpp_capture_star_this 201603L 2025-05-07T19:46:51.2192329Z #define __cpp_constexpr 201603L 2025-05-07T19:46:51.2192431Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:46:51.2192523Z #define __cpp_decltype 200707L 2025-05-07T19:46:51.2192625Z #define __cpp_decltype_auto 201304L 2025-05-07T19:46:51.2192721Z #define __cpp_deduction_guides 201703L 2025-05-07T19:46:51.2192833Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:46:51.2192938Z #define __cpp_digit_separators 201309L 2025-05-07T19:46:51.2193041Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:46:51.2193136Z #define __cpp_exceptions 199711L 2025-05-07T19:46:51.2193227Z #define __cpp_fold_expressions 201603L 2025-05-07T19:46:51.2193327Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:46:51.2193439Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:46:51.2193525Z #define __cpp_hex_float 201603L 2025-05-07T19:46:51.2193626Z #define __cpp_if_constexpr 201606L 2025-05-07T19:46:51.2193732Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:46:51.2193846Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:46:51.2193945Z #define __cpp_init_captures 201304L 2025-05-07T19:46:51.2194090Z #define __cpp_initializer_lists 200806L 2025-05-07T19:46:51.2194185Z #define __cpp_inline_variables 201606L 2025-05-07T19:46:51.2194269Z #define __cpp_lambdas 200907L 2025-05-07T19:46:51.2194383Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:46:51.2194483Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:46:51.2194572Z #define __cpp_lib_as_const 201510 2025-05-07T19:46:51.2194669Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:46:51.2194773Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:46:51.2194924Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:46:51.2195014Z #define __cpp_lib_hypot 201603 2025-05-07T19:46:51.2195117Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:46:51.2195240Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:46:51.2195335Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:46:51.2195433Z #define __cpp_lib_is_final 201402L 2025-05-07T19:46:51.2195527Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:46:51.2195621Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:46:51.2195712Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:46:51.2195807Z #define __cpp_lib_launder 201606 2025-05-07T19:46:51.2195899Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:46:51.2196012Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:46:51.2196137Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:46:51.2196235Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:46:51.2196360Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:46:51.2196497Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:46:51.2196594Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:46:51.2196684Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:46:51.2196821Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:46:51.2196914Z #define __cpp_lib_void_t 201411 2025-05-07T19:46:51.2197019Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:46:51.2197124Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:46:51.2197255Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:46:51.2197358Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:46:51.2197458Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:46:51.2197587Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:46:51.2197682Z #define __cpp_nsdmi 200809L 2025-05-07T19:46:51.2197823Z #define __cpp_range_based_for 201603L 2025-05-07T19:46:51.2197907Z #define __cpp_raw_strings 200710L 2025-05-07T19:46:51.2198010Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:46:51.2198110Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:46:51.2198193Z #define __cpp_rtti 199711L 2025-05-07T19:46:51.2198287Z #define __cpp_rvalue_references 200610L 2025-05-07T19:46:51.2198388Z #define __cpp_static_assert 201411L 2025-05-07T19:46:51.2198487Z #define __cpp_static_call_operator 202207L 2025-05-07T19:46:51.2198586Z #define __cpp_structured_bindings 201606L 2025-05-07T19:46:51.2198686Z #define __cpp_template_auto 201606L 2025-05-07T19:46:51.2198791Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:46:51.2198886Z #define __cpp_unicode_characters 200704L 2025-05-07T19:46:51.2198991Z #define __cpp_unicode_literals 200710L 2025-05-07T19:46:51.2199093Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:46:51.2199188Z #define __cpp_variable_templates 201304L 2025-05-07T19:46:51.2199287Z #define __cpp_variadic_templates 200704L 2025-05-07T19:46:51.2199555Z #define __cpp_variadic_using 201611L 2025-05-07T19:46:51.2199664Z #define __cudaCDP2DeviceGetAttribute 2025-05-07T19:46:51.2199771Z #define __cudaCDP2DeviceGetCacheConfig 2025-05-07T19:46:51.2199880Z #define __cudaCDP2DeviceGetLimit 2025-05-07T19:46:51.2200001Z #define __cudaCDP2DeviceGetSharedMemConfig 2025-05-07T19:46:51.2200110Z #define __cudaCDP2EventCreateWithFlags 2025-05-07T19:46:51.2200206Z #define __cudaCDP2EventDestroy 2025-05-07T19:46:51.2200363Z #define __cudaCDP2EventRecord 2025-05-07T19:46:51.2200474Z #define __cudaCDP2EventRecordWithFlags 2025-05-07T19:46:51.2200595Z #define __cudaCDP2EventRecordWithFlags_ptsz 2025-05-07T19:46:51.2200701Z #define __cudaCDP2EventRecord_ptsz 2025-05-07T19:46:51.2200784Z #define __cudaCDP2Free 2025-05-07T19:46:51.2200888Z #define __cudaCDP2FuncGetAttributes 2025-05-07T19:46:51.2200979Z #define __cudaCDP2GetDevice 2025-05-07T19:46:51.2201085Z #define __cudaCDP2GetDeviceCount 2025-05-07T19:46:51.2201182Z #define __cudaCDP2GetErrorName 2025-05-07T19:46:51.2201279Z #define __cudaCDP2GetErrorString 2025-05-07T19:46:51.2201378Z #define __cudaCDP2GetLastError 2025-05-07T19:46:51.2201486Z #define __cudaCDP2GetParameterBuffer 2025-05-07T19:46:51.2201597Z #define __cudaCDP2GetParameterBufferV2 2025-05-07T19:46:51.2201690Z #define __cudaCDP2LaunchDevice 2025-05-07T19:46:51.2201793Z #define __cudaCDP2LaunchDeviceV2 2025-05-07T19:46:51.2201900Z #define __cudaCDP2LaunchDeviceV2_ptsz 2025-05-07T19:46:51.2202005Z #define __cudaCDP2LaunchDevice_ptsz 2025-05-07T19:46:51.2202100Z #define __cudaCDP2Malloc 2025-05-07T19:46:51.2202201Z #define __cudaCDP2Memcpy2DAsync 2025-05-07T19:46:51.2202305Z #define __cudaCDP2Memcpy2DAsync_ptsz 2025-05-07T19:46:51.2202402Z #define __cudaCDP2Memcpy3DAsync 2025-05-07T19:46:51.2202513Z #define __cudaCDP2Memcpy3DAsync_ptsz 2025-05-07T19:46:51.2202610Z #define __cudaCDP2MemcpyAsync 2025-05-07T19:46:51.2202712Z #define __cudaCDP2MemcpyAsync_ptsz 2025-05-07T19:46:51.2202821Z #define __cudaCDP2Memset2DAsync 2025-05-07T19:46:51.2202924Z #define __cudaCDP2Memset2DAsync_ptsz 2025-05-07T19:46:51.2203021Z #define __cudaCDP2Memset3DAsync 2025-05-07T19:46:51.2203134Z #define __cudaCDP2Memset3DAsync_ptsz 2025-05-07T19:46:51.2203228Z #define __cudaCDP2MemsetAsync 2025-05-07T19:46:51.2203330Z #define __cudaCDP2MemsetAsync_ptsz 2025-05-07T19:46:51.2203532Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessor 2025-05-07T19:46:51.2203788Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessorWithFlags 2025-05-07T19:46:51.2203894Z #define __cudaCDP2PeekAtLastError 2025-05-07T19:46:51.2203997Z #define __cudaCDP2RuntimeGetVersion 2025-05-07T19:46:51.2204121Z #define __cudaCDP2StreamCreateWithFlags 2025-05-07T19:46:51.2204214Z #define __cudaCDP2StreamDestroy 2025-05-07T19:46:51.2204327Z #define __cudaCDP2StreamWaitEvent 2025-05-07T19:46:51.2204442Z #define __cudaCDP2StreamWaitEvent_ptsz 2025-05-07T19:46:51.2204535Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:46:51.2204700Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:46:51.2204795Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:46:51.2204902Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:46:51.2204997Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:46:51.2205137Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:46:51.2205232Z #define __daddr_t_defined 2025-05-07T19:46:51.2205313Z #define __dev_t_defined 2025-05-07T19:46:51.2205405Z #define __device__ __location__(device) 2025-05-07T19:46:51.2205543Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:46:51.2205781Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:46:51.2206006Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:46:51.2206139Z #define __errordecl(name,msg) extern void name (void) 2025-05-07T19:46:51.2206283Z #define __exctype(name) extern int name (int) __THROW 2025-05-07T19:46:51.2206461Z #define __exctype_l(name) extern int name (int, __locale_t) __THROW 2025-05-07T19:46:51.2206541Z #define __export__ 2025-05-07T19:46:51.2206800Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:51.2206996Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:51.2207078Z #define __flexarr [] 2025-05-07T19:46:51.2207251Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:46:51.2207521Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:46:51.2207612Z #define __fsblkcnt_t_defined 2025-05-07T19:46:51.2207699Z #define __fsfilcnt_t_defined 2025-05-07T19:46:51.2207793Z #define __gid_t_defined 2025-05-07T19:46:51.2207939Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:46:51.2208088Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:46:51.2208322Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:46:51.2208441Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:46:51.2208549Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:46:51.2208667Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:46:51.2208797Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:46:51.2209157Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:46:51.2209355Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:46:51.2209530Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:46:51.2209637Z #define __glibcxx_function_requires(...) 2025-05-07T19:46:51.2209738Z #define __glibcxx_integral_traps true 2025-05-07T19:46:51.2210047Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:46:51.2210294Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:46:51.2210493Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:46:51.2210637Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:46:51.2210838Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:46:51.2210950Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:46:51.2211067Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:46:51.2211225Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:46:51.2211358Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:46:51.2211497Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:46:51.2211679Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:46:51.2211960Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:46:51.2212100Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:46:51.2212248Z #define __glibcxx_requires_nonempty() 2025-05-07T19:46:51.2212428Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:46:51.2212636Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:51.2212809Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:46:51.2213023Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:51.2213135Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:46:51.2213285Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:46:51.2213447Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:46:51.2213637Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:46:51.2213736Z #define __glibcxx_requires_string(_String) 2025-05-07T19:46:51.2213857Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:46:51.2213966Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:46:51.2214087Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:46:51.2214188Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:46:51.2214285Z #define __global__ __location__(global) 2025-05-07T19:46:51.2214361Z #define __gnu_linux__ 1 2025-05-07T19:46:51.2214482Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:46:51.2214585Z #define __have_pthread_attr_t 1 2025-05-07T19:46:51.2214671Z #define __host__ __location__(host) 2025-05-07T19:46:51.2214797Z #define __id_t_defined 2025-05-07T19:46:51.2214870Z #define __import__ 2025-05-07T19:46:51.2215009Z #define __inline_hint__ __attribute__((nv_inline_hint)) 2025-05-07T19:46:51.2215088Z #define __ino64_t_defined 2025-05-07T19:46:51.2215165Z #define __ino_t_defined 2025-05-07T19:46:51.2215257Z #define __int8_t_defined 2025-05-07T19:46:51.2215469Z #define __intN_t(N,MODE) typedef int int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:51.2215605Z #define __isalnum_l(c,l) __isctype_l((c), _ISalnum, (l)) 2025-05-07T19:46:51.2215736Z #define __isalpha_l(c,l) __isctype_l((c), _ISalpha, (l)) 2025-05-07T19:46:51.2215838Z #define __isascii(c) (((c) & ~0x7f) == 0) 2025-05-07T19:46:51.2215941Z #define __isascii_l(c,l) ((l), __isascii (c)) 2025-05-07T19:46:51.2216072Z #define __isblank_l(c,l) __isctype_l((c), _ISblank, (l)) 2025-05-07T19:46:51.2216209Z #define __iscntrl_l(c,l) __isctype_l((c), _IScntrl, (l)) 2025-05-07T19:46:51.2216465Z #define __isctype_l(c,type,locale) ((locale)->__ctype_b[(int) (c)] & (unsigned short int) type) 2025-05-07T19:46:51.2216596Z #define __isdigit_l(c,l) __isctype_l((c), _ISdigit, (l)) 2025-05-07T19:46:51.2216734Z #define __isgraph_l(c,l) __isctype_l((c), _ISgraph, (l)) 2025-05-07T19:46:51.2216917Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:46:51.2217048Z #define __islower_l(c,l) __isctype_l((c), _ISlower, (l)) 2025-05-07T19:46:51.2217180Z #define __isprint_l(c,l) __isctype_l((c), _ISprint, (l)) 2025-05-07T19:46:51.2217316Z #define __ispunct_l(c,l) __isctype_l((c), _ISpunct, (l)) 2025-05-07T19:46:51.2217449Z #define __isspace_l(c,l) __isctype_l((c), _ISspace, (l)) 2025-05-07T19:46:51.2217577Z #define __isupper_l(c,l) __isctype_l((c), _ISupper, (l)) 2025-05-07T19:46:51.2217718Z #define __isxdigit_l(c,l) __isctype_l((c), _ISxdigit, (l)) 2025-05-07T19:46:51.2217793Z #define __k8 1 2025-05-07T19:46:51.2217870Z #define __k8__ 1 2025-05-07T19:46:51.2217946Z #define __key_t_defined 2025-05-07T19:46:51.2218138Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:46:51.2218221Z #define __ldiv_t_defined 1 2025-05-07T19:46:51.2218295Z #define __linux 1 2025-05-07T19:46:51.2218371Z #define __linux__ 1 2025-05-07T19:46:51.2218455Z #define __lldiv_t_defined 1 2025-05-07T19:46:51.2218527Z #define __llvm__ 1 2025-05-07T19:46:51.2218620Z #define __location__(a) __annotate__(a) 2025-05-07T19:46:51.2218713Z #define __long_double_t long double 2025-05-07T19:46:51.2218851Z #define __malloc_and_calloc_defined 2025-05-07T19:46:51.2218946Z #define __managed__ __location__(managed) 2025-05-07T19:46:51.2219065Z #define __maxnreg__(a) __attribute__((maxnreg(a))) 2025-05-07T19:46:51.2219142Z #define __mode_t_defined 2025-05-07T19:46:51.2219216Z #define __need_IOV_MAX 2025-05-07T19:46:51.2219301Z #define __need_clockid_t 2025-05-07T19:46:51.2219381Z #define __nlink_t_defined 2025-05-07T19:46:51.2219567Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:46:51.2219684Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:46:51.2219851Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:46:51.2220122Z #define __nv_pure__ __location__(nv_pure) 2025-05-07T19:46:51.2220210Z #define __off64_t_defined 2025-05-07T19:46:51.2220308Z #define __off_t_defined 2025-05-07T19:46:51.2220387Z #define __pic__ 2 2025-05-07T19:46:51.2220472Z #define __pid_t_defined 2025-05-07T19:46:51.2220554Z #define __pie__ 2 2025-05-07T19:46:51.2220666Z #define __private_extern__ extern 2025-05-07T19:46:51.2220750Z #define __ptr_t void * 2025-05-07T19:46:51.2220906Z #define __ptrvalue 2025-05-07T19:46:51.2220998Z #define __restrict_arr 2025-05-07T19:46:51.2221127Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:46:51.2221254Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:46:51.2221354Z #define __shared__ __location__(shared) 2025-05-07T19:46:51.2221448Z #define __sigset_t_defined 2025-05-07T19:46:51.2221549Z #define __specialization_static 2025-05-07T19:46:51.2221691Z #define __ssize_t_defined 2025-05-07T19:46:51.2221787Z #define __stub_bdflush 2025-05-07T19:46:51.2221871Z #define __stub_chflags 2025-05-07T19:46:51.2222136Z #define __stub_fattach 2025-05-07T19:46:51.2222223Z #define __stub_fchflags 2025-05-07T19:46:51.2222316Z #define __stub_fdetach 2025-05-07T19:46:51.2222401Z #define __stub_getmsg 2025-05-07T19:46:51.2222480Z #define __stub_gtty 2025-05-07T19:46:51.2222576Z #define __stub_lchmod 2025-05-07T19:46:51.2222657Z #define __stub_putmsg 2025-05-07T19:46:51.2222738Z #define __stub_revoke 2025-05-07T19:46:51.2222823Z #define __stub_setlogin 2025-05-07T19:46:51.2222919Z #define __stub_sigreturn 2025-05-07T19:46:51.2222999Z #define __stub_sstk 2025-05-07T19:46:51.2223081Z #define __stub_stty 2025-05-07T19:46:51.2223188Z #define __suseconds_t_defined 2025-05-07T19:46:51.2223274Z #define __thread__ __thread 2025-05-07T19:46:51.2223376Z #define __throw_exception_again throw 2025-05-07T19:46:51.2223466Z #define __time_t_defined 1 2025-05-07T19:46:51.2223569Z #define __timer_t_defined 1 2025-05-07T19:46:51.2223661Z #define __timespec_defined 1 2025-05-07T19:46:51.2223750Z #define __toascii(c) ((c) & 0x7f) 2025-05-07T19:46:51.2223874Z #define __toascii_l(c,l) ((l), __toascii (c)) 2025-05-07T19:46:51.2224440Z #define __tobody(c,f,a,args) (__extension__ ({ int __res; if (sizeof (c) > 1) { if (__builtin_constant_p (c)) { int __c = (c); __res = __c < -128 || __c > 255 ? __c : (a)[__c]; } else __res = f args; } else __res = (a)[(int) (c)]; __res; })) 2025-05-07T19:46:51.2224521Z #define __try try 2025-05-07T19:46:51.2224599Z #define __tune_k8__ 1 2025-05-07T19:46:51.2224696Z #define __u_char_defined 2025-05-07T19:46:51.2224967Z #define __u_intN_t(N,MODE) typedef unsigned int u_int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:51.2225050Z #define __uid_t_defined 2025-05-07T19:46:51.2225141Z #define __unbounded 2025-05-07T19:46:51.2225217Z #define __unix 1 2025-05-07T19:46:51.2225293Z #define __unix__ 1 2025-05-07T19:46:51.2225384Z #define __useconds_t_defined 2025-05-07T19:46:51.2225483Z #define __warnattr(msg) 2025-05-07T19:46:51.2225617Z #define __warndecl(name,msg) extern void name (void) 2025-05-07T19:46:51.2225690Z #define __wur 2025-05-07T19:46:51.2225777Z #define __x86_64 1 2025-05-07T19:46:51.2225862Z #define __x86_64__ 1 2025-05-07T19:46:51.2226029Z #define _tolower(c) ((int) (*__ctype_tolower_loc ())[(int) (c)]) 2025-05-07T19:46:51.2226204Z #define _toupper(c) ((int) (*__ctype_toupper_loc ())[(int) (c)]) 2025-05-07T19:46:51.2226416Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:46:51.2226767Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:51.2227357Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:51.2227464Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:46:51.2227556Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:46:51.2227648Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:46:51.2227769Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:46:51.2227866Z #define cudaArrayCubemap 0x04 2025-05-07T19:46:51.2227966Z #define cudaArrayDefault 0x00 2025-05-07T19:46:51.2228074Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:46:51.2228173Z #define cudaArrayLayered 0x01 2025-05-07T19:46:51.2228268Z #define cudaArraySparse 0x40 2025-05-07T19:46:51.2228429Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:46:51.2228548Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:46:51.2228654Z #define cudaArrayTextureGather 0x08 2025-05-07T19:46:51.2228839Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:46:51.2229019Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:46:51.2229115Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:46:51.2229216Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:46:51.2229326Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:46:51.2229539Z #define cudaDeviceMapHost 0x08 2025-05-07T19:46:51.2229629Z #define cudaDeviceMask 0xff 2025-05-07T19:46:51.2229735Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:46:51.2229867Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:46:51.2229972Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:46:51.2230074Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:46:51.2230179Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:46:51.2230292Z #define cudaDeviceSyncMemops 0x80 2025-05-07T19:46:51.2230393Z #define cudaEventBlockingSync 0x01 2025-05-07T19:46:51.2230487Z #define cudaEventDefault 0x00 2025-05-07T19:46:51.2230595Z #define cudaEventDisableTiming 0x02 2025-05-07T19:46:51.2230695Z #define cudaEventInterprocess 0x04 2025-05-07T19:46:51.2230798Z #define cudaEventRecordDefault 0x00 2025-05-07T19:46:51.2230903Z #define cudaEventRecordExternal 0x01 2025-05-07T19:46:51.2231010Z #define cudaEventWaitDefault 0x00 2025-05-07T19:46:51.2231113Z #define cudaEventWaitExternal 0x01 2025-05-07T19:46:51.2231236Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:46:51.2231445Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:46:51.2231631Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:46:51.2231811Z #define cudaGetDeviceProperties cudaGetDeviceProperties_v2 2025-05-07T19:46:51.2231941Z #define cudaGraphKernelNodePortDefault 0 2025-05-07T19:46:51.2232094Z #define cudaGraphKernelNodePortLaunchCompletion 2 2025-05-07T19:46:51.2232236Z #define cudaGraphKernelNodePortProgrammatic 1 2025-05-07T19:46:51.2232338Z #define cudaHostAllocDefault 0x00 2025-05-07T19:46:51.2232451Z #define cudaHostAllocMapped 0x02 2025-05-07T19:46:51.2232554Z #define cudaHostAllocPortable 0x01 2025-05-07T19:46:51.2232666Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:46:51.2232778Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:46:51.2232884Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:46:51.2232989Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:46:51.2233103Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:46:51.2233219Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:46:51.2233336Z #define cudaInitDeviceFlagsAreValid 0x01 2025-05-07T19:46:51.2233439Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:46:51.2233570Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:46:51.2233710Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:46:51.2233878Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:51.2234410Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:51.2234722Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:46:51.2235237Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:46:51.2235490Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:46:51.2235913Z #define cudaKernelNodeAttributeDeviceUpdatableKernelNode cudaLaunchAttributeDeviceUpdatableKernelNode 2025-05-07T19:46:51.2236186Z #define cudaKernelNodeAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:51.2236492Z #define cudaKernelNodeAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:51.2236958Z #define cudaKernelNodeAttributePreferredSharedMemoryCarveout cudaLaunchAttributePreferredSharedMemoryCarveout 2025-05-07T19:46:51.2237185Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:51.2237284Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:46:51.2237386Z #define cudaMemAttachHost 0x02 2025-05-07T19:46:51.2237485Z #define cudaMemAttachSingle 0x04 2025-05-07T19:46:51.2237587Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:46:51.2237686Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:46:51.2237792Z #define cudaOccupancyDefault 0x00 2025-05-07T19:46:51.2237935Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:46:51.2238036Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:46:51.2238449Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:46:51.2238580Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:46:51.2238728Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:51.2239040Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:51.2239296Z #define cudaStreamAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:51.2239580Z #define cudaStreamAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:51.2239779Z #define cudaStreamAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:51.2240126Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:46:51.2240222Z #define cudaStreamDefault 0x00 2025-05-07T19:46:51.2240357Z #define cudaStreamFireAndForget ((cudaStream_t)0x4) 2025-05-07T19:46:51.2240630Z #define cudaStreamGetCaptureInfo __CUDART_API_PTSZ(cudaStreamGetCaptureInfo_v2) 2025-05-07T19:46:51.2240841Z #define cudaStreamGraphFireAndForget (cudaStream_t)0x0200000000000000 2025-05-07T19:46:51.2241098Z #define cudaStreamGraphFireAndForgetAsSibling (cudaStream_t)0x0300000000000000 2025-05-07T19:46:51.2241302Z #define cudaStreamGraphTailLaunch (cudaStream_t)0x0100000000000000 2025-05-07T19:46:51.2241417Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:46:51.2241522Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:46:51.2241648Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:46:51.2241778Z #define cudaStreamTailLaunch ((cudaStream_t)0x3) 2025-05-07T19:46:51.2241874Z #define cudaSurfaceType1D 0x01 2025-05-07T19:46:51.2241980Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:46:51.2242082Z #define cudaSurfaceType2D 0x02 2025-05-07T19:46:51.2242186Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:46:51.2242282Z #define cudaSurfaceType3D 0x03 2025-05-07T19:46:51.2242392Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:46:51.2242514Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:46:51.2242610Z #define cudaTextureType1D 0x01 2025-05-07T19:46:51.2242713Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:46:51.2242813Z #define cudaTextureType2D 0x02 2025-05-07T19:46:51.2242914Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:46:51.2243005Z #define cudaTextureType3D 0x03 2025-05-07T19:46:51.2243116Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:46:51.2243286Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:46:51.2243614Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:46:51.2243706Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:46:51.2243801Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:46:51.2243887Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:46:51.2243972Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:46:51.2244056Z #define htole16(x) (x) 2025-05-07T19:46:51.2244132Z #define htole32(x) (x) 2025-05-07T19:46:51.2244212Z #define htole64(x) (x) 2025-05-07T19:46:51.2244324Z #define isalnum_l(c,l) __isalnum_l ((c), (l)) 2025-05-07T19:46:51.2244440Z #define isalpha_l(c,l) __isalpha_l ((c), (l)) 2025-05-07T19:46:51.2244526Z #define isascii(c) __isascii (c) 2025-05-07T19:46:51.2244637Z #define isascii_l(c,l) __isascii_l ((c), (l)) 2025-05-07T19:46:51.2244759Z #define isblank_l(c,l) __isblank_l ((c), (l)) 2025-05-07T19:46:51.2244867Z #define iscntrl_l(c,l) __iscntrl_l ((c), (l)) 2025-05-07T19:46:51.2244980Z #define isdigit_l(c,l) __isdigit_l ((c), (l)) 2025-05-07T19:46:51.2245090Z #define isgraph_l(c,l) __isgraph_l ((c), (l)) 2025-05-07T19:46:51.2245212Z #define islower_l(c,l) __islower_l ((c), (l)) 2025-05-07T19:46:51.2245322Z #define isprint_l(c,l) __isprint_l ((c), (l)) 2025-05-07T19:46:51.2245433Z #define ispunct_l(c,l) __ispunct_l ((c), (l)) 2025-05-07T19:46:51.2245664Z #define isspace_l(c,l) __isspace_l ((c), (l)) 2025-05-07T19:46:51.2245772Z #define isupper_l(c,l) __isupper_l ((c), (l)) 2025-05-07T19:46:51.2245939Z #define isxdigit_l(c,l) __isxdigit_l ((c), (l)) 2025-05-07T19:46:51.2246039Z #define le16toh(x) (x) 2025-05-07T19:46:51.2246120Z #define le32toh(x) (x) 2025-05-07T19:46:51.2246202Z #define le64toh(x) (x) 2025-05-07T19:46:51.2246281Z #define linux 1 2025-05-07T19:46:51.2246395Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:46:51.2246519Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:46:51.2246658Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:46:51.2246773Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:46:51.2246888Z #define offsetof(t,d) __builtin_offsetof(t, d) 2025-05-07T19:46:51.2246995Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:46:51.2247083Z #define stderr stderr 2025-05-07T19:46:51.2247180Z #define stdin stdin 2025-05-07T19:46:51.2247264Z #define stdout stdout 2025-05-07T19:46:51.2247735Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:51.2248261Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:51.2248354Z #define toascii(c) __toascii (c) 2025-05-07T19:46:51.2248462Z #define toascii_l(c,l) __toascii_l ((c), (l)) 2025-05-07T19:46:51.2248556Z #define unix 1 2025-05-07T19:46:51.2248680Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:46:51.2248799Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:46:51.2248909Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:46:51.2249028Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:46:51.2249143Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:46:51.2249150Z 2025-05-07T19:46:51.2347504Z 2025-05-07T19:46:51.2348041Z + conda run -n build_binary nvcc --version 2025-05-07T19:46:51.2348061Z 2025-05-07T19:46:52.8159023Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:46:52.8160033Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:46:52.8160615Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:46:52.8160937Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:46:52.8161292Z Build cuda_12.6.r12.6/compiler.35059454_0 2025-05-07T19:46:52.8161503Z 2025-05-07T19:46:52.8728496Z 2025-05-07T19:46:52.8737705Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:46:52.8740071Z [CHECK] nvidia-smi not found 2025-05-07T19:46:52.8741032Z [INSTALL] Successfully installed CUDA 12.6.3 2025-05-07T19:46:52.8830304Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:52.8830953Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:52.8831607Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:46:52.8831968Z env: 2025-05-07T19:46:52.8832232Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:46:52.8832553Z BUILD_ENV: build_binary 2025-05-07T19:46:52.8832837Z BUILD_TARGET: default 2025-05-07T19:46:52.8833080Z BUILD_VARIANT: cuda 2025-05-07T19:46:52.8833350Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:46:52.8833622Z ##[endgroup] 2025-05-07T19:46:53.3076379Z ################################################################################ 2025-05-07T19:46:53.3077513Z # Install PyTorch (PIP) 2025-05-07T19:46:53.3078201Z # 2025-05-07T19:46:53.3092054Z # [2025-05-07T19:46:53.308Z] + install_pytorch_pip build_binary nightly cuda/12.6.3 2025-05-07T19:46:53.3093384Z ################################################################################ 2025-05-07T19:46:53.3093622Z 2025-05-07T19:46:53.3119567Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:46:54.2198518Z Channels: 2025-05-07T19:46:54.2199172Z - conda-forge 2025-05-07T19:46:54.2199839Z Platform: linux-64 2025-05-07T19:46:57.3096644Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:46:58.9944107Z Solving environment: \ | / - done 2025-05-07T19:46:59.3053357Z 2025-05-07T19:46:59.3053770Z ## Package Plan ## 2025-05-07T19:46:59.3054225Z 2025-05-07T19:46:59.3054815Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:59.3055755Z 2025-05-07T19:46:59.3056066Z added / updated specs: 2025-05-07T19:46:59.3056775Z - numpy 2025-05-07T19:46:59.3057108Z 2025-05-07T19:46:59.3057120Z 2025-05-07T19:46:59.3057461Z The following packages will be downloaded: 2025-05-07T19:46:59.3058130Z 2025-05-07T19:46:59.3058473Z package | build 2025-05-07T19:46:59.3059398Z ---------------------------|----------------- 2025-05-07T19:46:59.3060780Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:46:59.3061502Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:46:59.3061983Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:46:59.3062460Z numpy-2.2.5 | py310hefbff90_0 7.6 MB conda-forge 2025-05-07T19:46:59.3062869Z ------------------------------------------------------------ 2025-05-07T19:46:59.3063244Z Total: 7.6 MB 2025-05-07T19:46:59.3063475Z 2025-05-07T19:46:59.3063608Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:59.3063880Z 2025-05-07T19:46:59.3064117Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:46:59.3064683Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:46:59.3065234Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:46:59.3065766Z numpy conda-forge/linux-64::numpy-2.2.5-py310hefbff90_0 2025-05-07T19:46:59.3066052Z 2025-05-07T19:46:59.3066056Z 2025-05-07T19:46:59.3066060Z 2025-05-07T19:46:59.3066336Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:59.3066692Z numpy-2.2.5 | 7.6 MB | | 0% 2025-05-07T19:46:59.3066930Z 2025-05-07T19:46:59.3067221Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:59.3067449Z 2025-05-07T19:46:59.3067452Z 2025-05-07T19:46:59.3070033Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:59.3070819Z 2025-05-07T19:46:59.3070831Z 2025-05-07T19:46:59.3070841Z 2025-05-07T19:46:59.6721238Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:59.6721569Z 2025-05-07T19:46:59.6721924Z 2025-05-07T19:46:59.6722427Z libcblas-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:59.6722691Z 2025-05-07T19:46:59.6726177Z 2025-05-07T19:46:59.6783216Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.6816385Z numpy-2.2.5 | 7.6 MB | | 0% 2025-05-07T19:46:59.6816758Z 2025-05-07T19:46:59.6816854Z 2025-05-07T19:46:59.6816868Z 2025-05-07T19:46:59.6830289Z liblapack-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:59.6830601Z 2025-05-07T19:46:59.6830605Z 2025-05-07T19:46:59.6830609Z 2025-05-07T19:46:59.7088635Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.7089495Z 2025-05-07T19:46:59.7089548Z 2025-05-07T19:46:59.7238972Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.7239792Z 2025-05-07T19:46:59.7239806Z 2025-05-07T19:46:59.7239817Z 2025-05-07T19:46:59.7302893Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.7303215Z 2025-05-07T19:46:59.7305211Z libblas-3.9.0 | 16 KB | #########7 | 97%  2025-05-07T19:46:59.7305483Z 2025-05-07T19:46:59.7570911Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.7571195Z 2025-05-07T19:46:59.7798634Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.8186351Z numpy-2.2.5 | 7.6 MB | ######3 | 64% 2025-05-07T19:47:00.1780922Z numpy-2.2.5 | 7.6 MB | ########## | 100% 2025-05-07T19:47:00.1786632Z numpy-2.2.5 | 7.6 MB | ########## | 100% 2025-05-07T19:47:00.1787631Z 2025-05-07T19:47:00.1788229Z 2025-05-07T19:47:00.1788924Z  2025-05-07T19:47:00.1789559Z 2025-05-07T19:47:00.1789572Z 2025-05-07T19:47:00.1790078Z  2025-05-07T19:47:00.1790700Z 2025-05-07T19:47:00.1790711Z 2025-05-07T19:47:00.1790743Z 2025-05-07T19:47:00.1791272Z  done 2025-05-07T19:47:00.2800415Z Preparing transaction: | done 2025-05-07T19:47:00.4811058Z Verifying transaction: - \ done 2025-05-07T19:47:00.5822944Z Executing transaction: / done 2025-05-07T19:47:00.6894077Z ################################################################################ 2025-05-07T19:47:00.6895161Z # Install Package From PyTorch PIP: torch 2025-05-07T19:47:00.6896029Z # 2025-05-07T19:47:00.6910785Z # [2025-05-07T19:47:00.690Z] + install_from_pytorch_pip build_binary torch nightly cuda/12.6.3 2025-05-07T19:47:00.6912311Z ################################################################################ 2025-05-07T19:47:00.6913009Z 2025-05-07T19:47:00.6927048Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:47:00.7808928Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:47:00.7810052Z ################################################################################ 2025-05-07T19:47:00.7811080Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:47:00.7811884Z # 2025-05-07T19:47:00.7823085Z # [2025-05-07T19:47:00.781Z] + __prepare_pip_arguments torch nightly cuda/12.6.3 2025-05-07T19:47:00.7824364Z ################################################################################ 2025-05-07T19:47:00.7825040Z 2025-05-07T19:47:00.7846811Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:47:00.7880682Z [INSTALL] Extracted package variant: cu126 2025-05-07T19:47:00.7898843Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:47:00.7900114Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:47:00.7905002Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:47:00.7919466Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu126/ ... 2025-05-07T19:47:00.7942714Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:35.9306030Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:48:35.9307652Z 2025-05-07T19:48:35.9310741Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:35.9311178Z Collecting torch 2025-05-07T19:48:35.9311872Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp310-cp310-manylinux_2_28_x86_64.whl.metadata (30 kB) 2025-05-07T19:48:35.9312678Z Collecting filelock (from torch) 2025-05-07T19:48:35.9313224Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:48:35.9314240Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from torch) (4.13.2) 2025-05-07T19:48:35.9315000Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:48:35.9315579Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:48:35.9316483Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 37.3 MB/s eta 0:00:00 2025-05-07T19:48:35.9316860Z Collecting networkx (from torch) 2025-05-07T19:48:35.9317379Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.4.2-py3-none-any.whl (1.7 MB) 2025-05-07T19:48:35.9318078Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 11.7 MB/s eta 0:00:00 2025-05-07T19:48:35.9318814Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from torch) (3.1.6) 2025-05-07T19:48:35.9319527Z Collecting fsspec (from torch) 2025-05-07T19:48:35.9320047Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:48:35.9320666Z Collecting nvidia-cuda-nvrtc-cu12==12.6.77 (from torch) 2025-05-07T19:48:35.9321441Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_nvrtc_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (23.7 MB) 2025-05-07T19:48:35.9322464Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.7/23.7 MB 67.5 MB/s eta 0:00:00 2025-05-07T19:48:35.9322902Z Collecting nvidia-cuda-runtime-cu12==12.6.77 (from torch) 2025-05-07T19:48:35.9323676Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_runtime_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (897 kB) 2025-05-07T19:48:35.9324550Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 897.7/897.7 kB 5.9 MB/s eta 0:00:00 2025-05-07T19:48:35.9324973Z Collecting nvidia-cuda-cupti-cu12==12.6.80 (from torch) 2025-05-07T19:48:35.9325733Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_cupti_cu12-12.6.80-py3-none-manylinux2014_x86_64.whl (8.9 MB) 2025-05-07T19:48:35.9326574Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 8.9/8.9 MB 50.0 MB/s eta 0:00:00 2025-05-07T19:48:35.9326967Z Collecting nvidia-cudnn-cu12==9.5.1.17 (from torch) 2025-05-07T19:48:35.9327712Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cudnn_cu12-9.5.1.17-py3-none-manylinux_2_28_x86_64.whl (571.0 MB) 2025-05-07T19:48:35.9328538Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 571.0/571.0 MB 32.6 MB/s eta 0:00:00 2025-05-07T19:48:35.9328950Z Collecting nvidia-cublas-cu12==12.6.4.1 (from torch) 2025-05-07T19:48:35.9329790Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cublas_cu12-12.6.4.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (393.1 MB) 2025-05-07T19:48:35.9332988Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 393.1/393.1 MB 58.1 MB/s eta 0:00:00 2025-05-07T19:48:35.9333391Z Collecting nvidia-cufft-cu12==11.3.0.4 (from torch) 2025-05-07T19:48:35.9334443Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufft_cu12-11.3.0.4-py3-none-manylinux2014_x86_64.whl (200.2 MB) 2025-05-07T19:48:35.9335293Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 200.2/200.2 MB 75.0 MB/s eta 0:00:00 2025-05-07T19:48:35.9335709Z Collecting nvidia-curand-cu12==10.3.7.77 (from torch) 2025-05-07T19:48:35.9336441Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_curand_cu12-10.3.7.77-py3-none-manylinux2014_x86_64.whl (56.3 MB) 2025-05-07T19:48:35.9337284Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 56.3/56.3 MB 50.9 MB/s eta 0:00:00 2025-05-07T19:48:35.9337688Z Collecting nvidia-cusolver-cu12==11.7.1.2 (from torch) 2025-05-07T19:48:35.9338457Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusolver_cu12-11.7.1.2-py3-none-manylinux2014_x86_64.whl (158.2 MB) 2025-05-07T19:48:35.9339399Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 158.2/158.2 MB 66.7 MB/s eta 0:00:00 2025-05-07T19:48:35.9339823Z Collecting nvidia-cusparse-cu12==12.5.4.2 (from torch) 2025-05-07T19:48:35.9340598Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparse_cu12-12.5.4.2-py3-none-manylinux2014_x86_64.whl (216.6 MB) 2025-05-07T19:48:35.9341446Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 216.6/216.6 MB 62.0 MB/s eta 0:00:00 2025-05-07T19:48:35.9341868Z Collecting nvidia-cusparselt-cu12==0.6.3 (from torch) 2025-05-07T19:48:35.9342623Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparselt_cu12-0.6.3-py3-none-manylinux2014_x86_64.whl (156.8 MB) 2025-05-07T19:48:35.9343473Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 156.8/156.8 MB 61.7 MB/s eta 0:00:00 2025-05-07T19:48:35.9343868Z Collecting nvidia-nccl-cu12==2.26.2 (from torch) 2025-05-07T19:48:35.9344705Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (2.0 kB) 2025-05-07T19:48:35.9345549Z Collecting nvidia-nvtx-cu12==12.6.77 (from torch) 2025-05-07T19:48:35.9346255Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvtx_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (89 kB) 2025-05-07T19:48:35.9346988Z Collecting nvidia-nvjitlink-cu12==12.6.85 (from torch) 2025-05-07T19:48:35.9347840Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvjitlink_cu12-12.6.85-py3-none-manylinux2010_x86_64.manylinux_2_12_x86_64.whl (19.7 MB) 2025-05-07T19:48:35.9348767Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 19.7/19.7 MB 52.5 MB/s eta 0:00:00 2025-05-07T19:48:35.9349178Z Collecting nvidia-cufile-cu12==1.11.1.6 (from torch) 2025-05-07T19:48:35.9350028Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (1.5 kB) 2025-05-07T19:48:35.9351002Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:48:35.9352210Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:48:35.9353598Z Requirement already satisfied: setuptools>=40.8.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from pytorch-triton==3.3.0+git96316ce5->torch) (78.1.1) 2025-05-07T19:48:35.9354535Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:48:35.9355123Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:48:35.9355814Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 8.9 MB/s eta 0:00:00 2025-05-07T19:48:35.9356620Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:48:35.9357790Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp310-cp310-manylinux_2_28_x86_64.whl (825.5 MB) 2025-05-07T19:48:35.9358751Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 825.5/825.5 MB 23.7 MB/s eta 0:00:00 2025-05-07T19:48:35.9359574Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (1.1 MB) 2025-05-07T19:48:35.9360498Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.1/1.1 MB 7.8 MB/s eta 0:00:00 2025-05-07T19:48:35.9361319Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (201.3 MB) 2025-05-07T19:48:35.9362321Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 201.3/201.3 MB 105.0 MB/s eta 0:00:00 2025-05-07T19:48:35.9363162Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.4 MB) 2025-05-07T19:48:35.9364180Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.4/153.4 MB 132.5 MB/s eta 0:00:00 2025-05-07T19:48:35.9365875Z Installing collected packages: nvidia-cusparselt-cu12, mpmath, sympy, pytorch-triton, nvidia-nvtx-cu12, nvidia-nvjitlink-cu12, nvidia-nccl-cu12, nvidia-curand-cu12, nvidia-cufile-cu12, nvidia-cuda-runtime-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-cupti-cu12, nvidia-cublas-cu12, networkx, fsspec, filelock, nvidia-cusparse-cu12, nvidia-cufft-cu12, nvidia-cudnn-cu12, nvidia-cusolver-cu12, torch 2025-05-07T19:48:35.9367451Z 2025-05-07T19:48:35.9369556Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.4.2 nvidia-cublas-cu12-12.6.4.1 nvidia-cuda-cupti-cu12-12.6.80 nvidia-cuda-nvrtc-cu12-12.6.77 nvidia-cuda-runtime-cu12-12.6.77 nvidia-cudnn-cu12-9.5.1.17 nvidia-cufft-cu12-11.3.0.4 nvidia-cufile-cu12-1.11.1.6 nvidia-curand-cu12-10.3.7.77 nvidia-cusolver-cu12-11.7.1.2 nvidia-cusparse-cu12-12.5.4.2 nvidia-cusparselt-cu12-0.6.3 nvidia-nccl-cu12-2.26.2 nvidia-nvjitlink-cu12-12.6.85 nvidia-nvtx-cu12-12.6.77 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu126 2025-05-07T19:48:35.9371882Z 2025-05-07T19:48:37.8974702Z torch 2.8.0.dev20250507+cu126 2025-05-07T19:48:37.8975272Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu126) 2025-05-07T19:48:41.0103201Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:48:44.1057579Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu126 2025-05-07T19:48:44.1058171Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:48:47.0661659Z True 2025-05-07T19:48:47.0662286Z True 2025-05-07T19:48:47.0662637Z 2025-05-07T19:48:47.1228547Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:48:47.1316104Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:47.1316777Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:47.1317657Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:47.1317999Z env: 2025-05-07T19:48:47.1318274Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:47.1318587Z BUILD_ENV: build_binary 2025-05-07T19:48:47.1318875Z BUILD_TARGET: default 2025-05-07T19:48:47.1319126Z BUILD_VARIANT: cuda 2025-05-07T19:48:47.1319389Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:47.1319647Z ##[endgroup] 2025-05-07T19:48:47.6037139Z /github/home/miniconda/bin/conda 2025-05-07T19:48:47.6037513Z ################################################################################ 2025-05-07T19:48:47.6055735Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:48:47.6056148Z # 2025-05-07T19:48:47.6056533Z # [2025-05-07T19:48:47.604Z] + collect_pytorch_env_info build_binary 2025-05-07T19:48:47.6056948Z ################################################################################ 2025-05-07T19:48:47.6057213Z 2025-05-07T19:48:47.6069444Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:47.6946449Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:47.6953965Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:48:47.6954653Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:48:47.6955089Z 2025-05-07T19:48:47.7863741Z 2025-05-07T19:48:47.7864127Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:48:47.7888575Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:48:53.2135582Z Collecting environment information... 2025-05-07T19:48:53.2136032Z PyTorch version: 2.8.0.dev20250507+cu126 2025-05-07T19:48:53.2136398Z Is debug build: False 2025-05-07T19:48:53.2136702Z CUDA used to build PyTorch: 12.6 2025-05-07T19:48:53.2137012Z ROCM used to build PyTorch: N/A 2025-05-07T19:48:53.2137258Z 2025-05-07T19:48:53.2137372Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:48:53.2137713Z GCC version: Could not collect 2025-05-07T19:48:53.2138322Z Clang version: 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:48:53.2138956Z CMake version: version 4.0.2 2025-05-07T19:48:53.2139244Z Libc version: glibc-2.34 2025-05-07T19:48:53.2139438Z 2025-05-07T19:48:53.2139935Z Python version: 3.10.17 | packaged by conda-forge | (main, Apr 10 2025, 22:19:12) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:48:53.2140610Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:48:53.2141082Z Is CUDA available: False 2025-05-07T19:48:53.2141374Z CUDA runtime version: 12.6.85 2025-05-07T19:48:53.2141719Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:48:53.2142068Z GPU models and configuration: Could not collect 2025-05-07T19:48:53.2142428Z Nvidia driver version: Could not collect 2025-05-07T19:48:53.2142769Z cuDNN version: Could not collect 2025-05-07T19:48:53.2143080Z HIP runtime version: N/A 2025-05-07T19:48:53.2143340Z MIOpen runtime version: N/A 2025-05-07T19:48:53.2143628Z Is XNNPACK available: True 2025-05-07T19:48:53.2143795Z 2025-05-07T19:48:53.2143879Z CPU: 2025-05-07T19:48:53.2144117Z Architecture: x86_64 2025-05-07T19:48:53.2144466Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:48:53.2144893Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:48:53.2145301Z Byte Order: Little Endian 2025-05-07T19:48:53.2145652Z CPU(s): 96 2025-05-07T19:48:53.2145984Z On-line CPU(s) list: 0-95 2025-05-07T19:48:53.2146429Z Vendor ID: GenuineIntel 2025-05-07T19:48:53.2147252Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:48:53.2147650Z CPU family: 6 2025-05-07T19:48:53.2148110Z Model: 85 2025-05-07T19:48:53.2148401Z Thread(s) per core: 2 2025-05-07T19:48:53.2148715Z Core(s) per socket: 24 2025-05-07T19:48:53.2149007Z Socket(s): 2 2025-05-07T19:48:53.2149307Z Stepping: 7 2025-05-07T19:48:53.2149649Z BogoMIPS: 5999.99 2025-05-07T19:48:53.2152080Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:48:53.2154299Z Hypervisor vendor: KVM 2025-05-07T19:48:53.2154613Z Virtualization type: full 2025-05-07T19:48:53.2154963Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:48:53.2155331Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:48:53.2155707Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:48:53.2156069Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:48:53.2156414Z NUMA node(s): 2 2025-05-07T19:48:53.2156738Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:48:53.2157067Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:48:53.2157541Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:48:53.2158078Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:48:53.2158571Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:48:53.2159153Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:53.2159731Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:48:53.2160339Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:53.2160918Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:48:53.2161301Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:48:53.2161664Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:48:53.2162051Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:48:53.2162583Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:48:53.2163397Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:48:53.2164035Z Vulnerability Srbds: Not affected 2025-05-07T19:48:53.2164393Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:48:53.2164631Z 2025-05-07T19:48:53.2164754Z Versions of relevant libraries: 2025-05-07T19:48:53.2165005Z [pip3] numpy==2.2.5 2025-05-07T19:48:53.2165256Z [pip3] nvidia-cublas-cu12==12.6.4.1 2025-05-07T19:48:53.2165548Z [pip3] nvidia-cuda-cupti-cu12==12.6.80 2025-05-07T19:48:53.2165865Z [pip3] nvidia-cuda-nvrtc-cu12==12.6.77 2025-05-07T19:48:53.2166185Z [pip3] nvidia-cuda-runtime-cu12==12.6.77 2025-05-07T19:48:53.2166483Z [pip3] nvidia-cudnn-cu12==9.5.1.17 2025-05-07T19:48:53.2166773Z [pip3] nvidia-cufft-cu12==11.3.0.4 2025-05-07T19:48:53.2167048Z [pip3] nvidia-curand-cu12==10.3.7.77 2025-05-07T19:48:53.2167354Z [pip3] nvidia-cusolver-cu12==11.7.1.2 2025-05-07T19:48:53.2167791Z [pip3] nvidia-cusparse-cu12==12.5.4.2 2025-05-07T19:48:53.2168108Z [pip3] nvidia-cusparselt-cu12==0.6.3 2025-05-07T19:48:53.2168394Z [pip3] nvidia-nccl-cu12==2.26.2 2025-05-07T19:48:53.2168765Z [pip3] nvidia-nvjitlink-cu12==12.6.85 2025-05-07T19:48:53.2169057Z [pip3] nvidia-nvtx-cu12==12.6.77 2025-05-07T19:48:53.2169359Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:48:53.2169674Z [pip3] torch==2.8.0.dev20250507+cu126 2025-05-07T19:48:53.2170030Z [conda] cuda-cudart 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:53.2170527Z [conda] cuda-cudart-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:53.2171033Z [conda] cuda-cudart-dev_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:53.2171557Z [conda] cuda-cudart-static 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:53.2172084Z [conda] cuda-cudart-static_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:53.2172632Z [conda] cuda-cudart_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:53.2173117Z [conda] cuda-cupti 12.6.80 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.2173579Z [conda] cuda-cupti-dev 12.6.80 h5888daf_0 conda-forge 2025-05-07T19:48:53.2174068Z [conda] cuda-libraries 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:53.2174554Z [conda] cuda-libraries-dev 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:53.2175041Z [conda] cuda-nvrtc 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.2175676Z [conda] cuda-nvrtc-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:53.2176170Z [conda] cuda-nvtx 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.2176655Z [conda] cuda-opencl 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.2177144Z [conda] cuda-opencl-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:53.2177653Z [conda] cuda-runtime 12.6.3 ha804496_0 conda-forge 2025-05-07T19:48:53.2178125Z [conda] libcublas 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:53.2178620Z [conda] libcublas-dev 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:53.2179109Z [conda] libcufft 11.3.0.4 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.2179694Z [conda] libcufft-dev 11.3.0.4 h5888daf_0 conda-forge 2025-05-07T19:48:53.2180379Z [conda] libcurand 10.3.7.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.2180876Z [conda] libcurand-dev 10.3.7.77 h5888daf_0 conda-forge 2025-05-07T19:48:53.2181397Z [conda] libcusolver 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:53.2181912Z [conda] libcusolver-dev 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:53.2182440Z [conda] libcusparse 12.5.4.2 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.2182970Z [conda] libcusparse-dev 12.5.4.2 h5888daf_0 conda-forge 2025-05-07T19:48:53.2183488Z [conda] libnvjitlink 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.2184019Z [conda] libnvjitlink-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:53.2184508Z [conda] numpy 2.2.5 py310hefbff90_0 conda-forge 2025-05-07T19:48:53.2185014Z [conda] nvidia-cublas-cu12 12.6.4.1 pypi_0 pypi 2025-05-07T19:48:53.2185554Z [conda] nvidia-cuda-cupti-cu12 12.6.80 pypi_0 pypi 2025-05-07T19:48:53.2186197Z [conda] nvidia-cuda-nvrtc-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:53.2186704Z [conda] nvidia-cuda-runtime-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:53.2187284Z [conda] nvidia-cudnn-cu12 9.5.1.17 pypi_0 pypi 2025-05-07T19:48:53.2187767Z [conda] nvidia-cufft-cu12 11.3.0.4 pypi_0 pypi 2025-05-07T19:48:53.2188302Z [conda] nvidia-curand-cu12 10.3.7.77 pypi_0 pypi 2025-05-07T19:48:53.2188794Z [conda] nvidia-cusolver-cu12 11.7.1.2 pypi_0 pypi 2025-05-07T19:48:53.2189291Z [conda] nvidia-cusparse-cu12 12.5.4.2 pypi_0 pypi 2025-05-07T19:48:53.2189775Z [conda] nvidia-cusparselt-cu12 0.6.3 pypi_0 pypi 2025-05-07T19:48:53.2190267Z [conda] nvidia-nccl-cu12 2.26.2 pypi_0 pypi 2025-05-07T19:48:53.2190734Z [conda] nvidia-nvjitlink-cu12 12.6.85 pypi_0 pypi 2025-05-07T19:48:53.2191221Z [conda] nvidia-nvtx-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:53.2191687Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:48:53.2192158Z [conda] torch 2.8.0.dev20250507+cu126 pypi_0 pypi 2025-05-07T19:48:53.2192429Z 2025-05-07T19:48:53.3142698Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:53.3143394Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:53.3144001Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:53.3144344Z env: 2025-05-07T19:48:53.3144584Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:53.3144895Z BUILD_ENV: build_binary 2025-05-07T19:48:53.3145152Z BUILD_TARGET: default 2025-05-07T19:48:53.3145382Z BUILD_VARIANT: cuda 2025-05-07T19:48:53.3145624Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:53.3145869Z ##[endgroup] 2025-05-07T19:48:53.8326614Z ################################################################################ 2025-05-07T19:48:53.8327015Z # Install cuDNN 2025-05-07T19:48:53.8327291Z # 2025-05-07T19:48:53.8341515Z # [2025-05-07T19:48:53.833Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 12.6.3 2025-05-07T19:48:53.8342184Z ################################################################################ 2025-05-07T19:48:53.8342445Z 2025-05-07T19:48:53.8365871Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:53.9262199Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:53.9263432Z [INSTALL] cuda_concat_version is determined to be: 126 2025-05-07T19:48:53.9264579Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:53.9265237Z 2025-05-07T19:48:53.9281035Z 2025-05-07T19:48:53.9281857Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:53.9282634Z 2025-05-07T19:48:53.9301343Z 2025-05-07T19:48:53.9324844Z [INSTALL] Downloading cuDNN to /tmp/tmp.I5537bXVbn ... 2025-05-07T19:48:53.9345730Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/cudnn/redist/cudnn/linux-x86_64/cudnn-linux-x86_64-9.5.1.17_cuda12-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:49:01.1301234Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:49:01.1301733Z + tar -xvf cudnn.tar.xz 2025-05-07T19:49:01.1301920Z 2025-05-07T19:49:01.1332715Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/ 2025-05-07T19:49:01.1333280Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/ 2025-05-07T19:49:01.1334240Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static_v9.a 2025-05-07T19:49:05.7988299Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static_v9.a 2025-05-07T19:49:05.8622404Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static_v9.a 2025-05-07T19:49:13.3965134Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static_v9.a 2025-05-07T19:49:13.6397603Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static_v9.a 2025-05-07T19:49:13.6772213Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static_v9.a 2025-05-07T19:49:14.2126530Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static_v9.a 2025-05-07T19:49:16.2987645Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static.a 2025-05-07T19:49:16.2988245Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static.a 2025-05-07T19:49:16.2989204Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static.a 2025-05-07T19:49:16.2989875Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static.a 2025-05-07T19:49:16.2990523Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static.a 2025-05-07T19:49:16.2991079Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static.a 2025-05-07T19:49:16.2991644Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static.a 2025-05-07T19:49:16.2992171Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so 2025-05-07T19:49:16.2992634Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9 2025-05-07T19:49:16.2993146Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9.5.1 2025-05-07T19:49:16.2999535Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so 2025-05-07T19:49:16.3003415Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9 2025-05-07T19:49:16.3003937Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9.5.1 2025-05-07T19:49:20.8141260Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so 2025-05-07T19:49:20.8141828Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9.5.1 2025-05-07T19:49:20.8757549Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9 2025-05-07T19:49:20.8758212Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9.5.1 2025-05-07T19:49:28.1239477Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9 2025-05-07T19:49:28.1240187Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so 2025-05-07T19:49:28.1240803Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so 2025-05-07T19:49:28.1241476Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9.5.1 2025-05-07T19:49:28.3202530Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9 2025-05-07T19:49:28.3203177Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9 2025-05-07T19:49:28.3203685Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so 2025-05-07T19:49:28.3204187Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9.5.1 2025-05-07T19:49:28.3562384Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9.5.1 2025-05-07T19:49:28.8986730Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9 2025-05-07T19:49:28.8987293Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so 2025-05-07T19:49:28.8987809Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9 2025-05-07T19:49:28.8988299Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so 2025-05-07T19:49:28.8988830Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9.5.1 2025-05-07T19:49:31.0203332Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/ 2025-05-07T19:49:31.0203903Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_v9.h 2025-05-07T19:49:31.0204400Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv_v9.h 2025-05-07T19:49:31.0204896Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend_v9.h 2025-05-07T19:49:31.0205448Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn_v9.h 2025-05-07T19:49:31.0206003Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph_v9.h 2025-05-07T19:49:31.0206488Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops_v9.h 2025-05-07T19:49:31.0209333Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version_v9.h 2025-05-07T19:49:31.0209908Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn.h 2025-05-07T19:49:31.0210416Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv.h 2025-05-07T19:49:31.0210954Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend.h 2025-05-07T19:49:31.0211497Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn.h 2025-05-07T19:49:31.0212256Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph.h 2025-05-07T19:49:31.0212785Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops.h 2025-05-07T19:49:31.0213274Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version.h 2025-05-07T19:49:31.0213703Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/LICENSE 2025-05-07T19:49:31.0229117Z 2025-05-07T19:49:31.0229372Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:49:31.0229880Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:31.0230122Z 2025-05-07T19:49:31.0251203Z 2025-05-07T19:49:31.0251587Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:31.0251845Z 2025-05-07T19:49:31.0272070Z 2025-05-07T19:49:31.0273180Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:31.0274344Z 2025-05-07T19:49:31.0307718Z 2025-05-07T19:49:31.0309291Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:31.0310483Z 2025-05-07T19:49:32.1511569Z 2025-05-07T19:49:32.1512449Z /__w/FBGEMM/FBGEMM 2025-05-07T19:49:32.1513202Z + rm -rf /tmp/tmp.I5537bXVbn 2025-05-07T19:49:32.1513720Z 2025-05-07T19:49:32.1982609Z 2025-05-07T19:49:32.1987677Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:49:32.1988902Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:32.1989594Z 2025-05-07T19:49:32.6046745Z 2025-05-07T19:49:32.6047124Z [INSTALL] Successfully installed cuDNN (for CUDA 12.6.3) 2025-05-07T19:49:32.6113792Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:32.6114412Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:32.6115148Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:32.6115491Z env: 2025-05-07T19:49:32.6115731Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:32.6116051Z BUILD_ENV: build_binary 2025-05-07T19:49:32.6116295Z BUILD_TARGET: default 2025-05-07T19:49:32.6116547Z BUILD_VARIANT: cuda 2025-05-07T19:49:32.6116783Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:32.6117048Z ##[endgroup] 2025-05-07T19:49:33.0605239Z ################################################################################ 2025-05-07T19:49:33.0621359Z # Prepare FBGEMM-GPU Build 2025-05-07T19:49:33.0621676Z # 2025-05-07T19:49:33.0622172Z # [2025-05-07T19:49:33.061Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:49:33.0622681Z ################################################################################ 2025-05-07T19:49:33.0622911Z 2025-05-07T19:49:33.0646098Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:33.1463892Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:33.1481582Z [BUILD] Running git submodules update ... 2025-05-07T19:49:33.1502267Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:49:33.1802671Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:49:33.1804130Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:49:33.1804890Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:49:33.1805317Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:49:33.1805738Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:49:33.1806217Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:49:33.1806634Z Synchronizing submodule url for '../external/json' 2025-05-07T19:49:33.1834986Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:49:33.2266324Z [BUILD] Installing other build dependencies ... 2025-05-07T19:49:33.2287532Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:49:35.0968999Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:49:35.1148656Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:49:35.1246113Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:49:35.2567322Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:49:35.2612025Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:49:35.2685266Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:49:35.2686612Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:49:35.2688462Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:49:35.2691973Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:49:35.3003644Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:49:35.3051871Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:49:35.3122957Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 21)) (2.2.5) 2025-05-07T19:49:35.3268205Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:49:35.3318210Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:49:35.3383252Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:49:35.3384826Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:49:35.3388301Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:49:35.3607555Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:49:35.3662607Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:49:35.3846708Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:49:35.3878158Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:49:35.4117936Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:49:35.4154088Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:49:35.4299280Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:49:35.4302706Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:49:35.4305055Z Requirement already satisfied: tomli>=1.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build->-r requirements.txt (line 14)) (2.2.1) 2025-05-07T19:49:35.4424392Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:49:35.4426592Z Requirement already satisfied: exceptiongroup>=1.0.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from hypothesis->-r requirements.txt (line 17)) (1.2.2) 2025-05-07T19:49:35.4433625Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:49:35.4453924Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:49:35.4586736Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:35.4621017Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:49:35.4687215Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:49:35.4731468Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:49:35.4738648Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:49:35.5046283Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:35.5093208Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:49:35.5188319Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:49:35.5318686Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:49:35.6531670Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 233.8 MB/s eta 0:00:00 2025-05-07T19:49:35.6572589Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:49:35.6664978Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:49:35.6844784Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:49:35.6948339Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:49:35.7122502Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:49:35.7331940Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:49:35.7398790Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:49:35.9181628Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:49:36.8524502Z 2025-05-07T19:49:36.8584578Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:49:36.8587139Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:36.9871195Z ################################################################################ 2025-05-07T19:49:36.9872296Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:49:36.9873071Z # 2025-05-07T19:49:36.9890874Z # [2025-05-07T19:49:36.988Z] + install_triton_pip build_binary 2025-05-07T19:49:36.9892116Z ################################################################################ 2025-05-07T19:49:36.9892820Z 2025-05-07T19:49:36.9893499Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:49:36.9894816Z ################################################################################ 2025-05-07T19:49:36.9895899Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:49:36.9896610Z # 2025-05-07T19:49:36.9906515Z # [2025-05-07T19:49:36.990Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:36.9908106Z ################################################################################ 2025-05-07T19:49:36.9908777Z 2025-05-07T19:49:36.9924301Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:37.0745699Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:37.0746546Z ################################################################################ 2025-05-07T19:49:37.0747084Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:49:37.0747519Z # 2025-05-07T19:49:37.0763391Z # [2025-05-07T19:49:37.075Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:37.0763991Z ################################################################################ 2025-05-07T19:49:37.0764223Z 2025-05-07T19:49:37.0807940Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:49:37.0818777Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:49:37.0820652Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:37.0825772Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:37.0833204Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:49:37.0855908Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:42.4265280Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:49:42.4266264Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:42.4266746Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:42.4267619Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:49:42.4268981Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.5 MB) 2025-05-07T19:49:42.4270129Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.5/166.5 MB 170.3 MB/s eta 0:00:00 2025-05-07T19:49:42.4270648Z Installing collected packages: pytorch-triton 2025-05-07T19:49:42.4271135Z Attempting uninstall: pytorch-triton 2025-05-07T19:49:42.4271524Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:49:42.4271960Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:49:42.4272371Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:49:42.4272825Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:49:42.4273099Z 2025-05-07T19:49:42.4273779Z torch 2.8.0.dev20250507+cu126 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:49:42.4275739Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:42.4277074Z 2025-05-07T19:49:44.2873389Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:49:44.2874903Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:49:46.0788793Z ################################################################################ 2025-05-07T19:49:46.0790048Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:49:46.0791627Z ################################################################################ 2025-05-07T19:49:46.0792314Z 2025-05-07T19:49:47.8168594Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:49:49.6052695Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:49:49.6053853Z [BUILD] Successfully ran git submodules update 2025-05-07T19:49:49.6135306Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:49.6136098Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:49.6136732Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:49.6137105Z env: 2025-05-07T19:49:49.6137353Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:49.6137697Z BUILD_ENV: build_binary 2025-05-07T19:49:49.6137953Z BUILD_TARGET: default 2025-05-07T19:49:49.6138219Z BUILD_VARIANT: cuda 2025-05-07T19:49:49.6138517Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:49.6138769Z ##[endgroup] 2025-05-07T19:49:50.0425301Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:49:50.0426359Z [BUILD] Extracted build target: default 2025-05-07T19:49:50.0427291Z [BUILD] Extracted build variant: cuda 2025-05-07T19:49:51.6204146Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:49:51.6204679Z 2025-05-07T19:49:51.6772606Z [CHECK] Binary cc found in PATH 2025-05-07T19:49:53.2432014Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:49:53.2432805Z 2025-05-07T19:49:53.2991531Z [CHECK] Binary gcc found in PATH 2025-05-07T19:49:54.8693041Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:49:54.8693833Z 2025-05-07T19:49:54.9261886Z [CHECK] Binary c++ found in PATH 2025-05-07T19:49:56.4966155Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:49:56.4966662Z 2025-05-07T19:49:56.5547821Z [CHECK] Binary g++ found in PATH 2025-05-07T19:49:58.1836853Z [BUILD] Extracted and set Python tag: py310 2025-05-07T19:49:58.1837871Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:49:58.2061684Z core = 24 2025-05-07T19:49:58.2279832Z sockets = 2 2025-05-07T19:49:58.2280701Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:49:58.2281283Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:49:58.2281565Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:49:58.2281996Z + rm -rf dist 2025-05-07T19:49:58.2282118Z 2025-05-07T19:49:58.2296882Z 2025-05-07T19:49:58.2297803Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:49:58.2298305Z 2025-05-07T19:50:01.1222983Z INFO:root:running clean 2025-05-07T19:50:01.1223328Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:50:01.1224392Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:01.1225464Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:50:01.1225972Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:01.1226826Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:01.1227372Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:01.1227925Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:01.1228304Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:01.1229477Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:50:01.4164823Z 2025-05-07T19:50:01.4165469Z [BUILD] Printing git status ... 2025-05-07T19:50:01.4166289Z + git status 2025-05-07T19:50:01.4167139Z 2025-05-07T19:50:02.0975015Z HEAD detached at pull/4066/merge 2025-05-07T19:50:02.0975899Z Untracked files: 2025-05-07T19:50:02.0976797Z (use "git add ..." to include in what will be committed) 2025-05-07T19:50:02.0977823Z ../build_only/ 2025-05-07T19:50:02.0978438Z ../collect_env.py 2025-05-07T19:50:02.0979088Z fbgemm_gpu/docs/version.py 2025-05-07T19:50:02.0979826Z 2025-05-07T19:50:02.0981046Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:50:02.0981415Z 2025-05-07T19:50:02.0981526Z + git diff 2025-05-07T19:50:02.0981646Z 2025-05-07T19:50:02.1240844Z 2025-05-07T19:50:02.1241517Z ################################################################################ 2025-05-07T19:50:02.1242590Z # Configure FBGEMM-GPU Build 2025-05-07T19:50:02.1243326Z # 2025-05-07T19:50:02.1259692Z # [2025-05-07T19:50:02.125Z] + __configure_fbgemm_gpu_build 2025-05-07T19:50:02.1260895Z ################################################################################ 2025-05-07T19:50:02.1261149Z 2025-05-07T19:50:02.1262562Z [BUILD] Setting the build target: default ... 2025-05-07T19:50:02.1263053Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:50:03.6973911Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:50:03.6974735Z 2025-05-07T19:50:03.7530412Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:50:05.3249199Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:50:05.3806502Z 2025-05-07T19:50:05.3807484Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:50:06.9542606Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:50:06.9543308Z 2025-05-07T19:50:07.0109110Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:50:08.5839700Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:08.5840124Z 2025-05-07T19:50:08.6397427Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:50:10.2949724Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:50:10.2950312Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:50:10.2950637Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:50:10.2950969Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:50:10.2951344Z Build cuda_12.6.r12.6/compiler.35059454_0 ... 2025-05-07T19:50:10.2951743Z [BUILD] Setting the following CUDA targets: 7.0;8.0;9.0;9.0a 2025-05-07T19:50:10.2952121Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:50:11.9383587Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:50:15.2651067Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:50:15.2652121Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:50:15.2652537Z 2025-05-07T19:50:15.6761739Z 2025-05-07T19:50:15.6762397Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:17.3620806Z [BUILD] Looking up CUDA version ... 2025-05-07T19:50:20.6292189Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:20.6292591Z 2025-05-07T19:50:22.2831211Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:22.2833883Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:22.2835271Z 2025-05-07T19:50:22.2835606Z [BUILD] Setting NVCC flags ... 2025-05-07T19:50:22.2837399Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++20 -Xcompiler -std=c++20 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:50:22.2838175Z 2025-05-07T19:50:22.6900726Z 2025-05-07T19:50:22.6902096Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:50:22.6902925Z 2025-05-07T19:50:24.2585712Z -std=c++20 -Xcompiler -std=c++20 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:50:24.2586810Z 2025-05-07T19:50:24.3145116Z 2025-05-07T19:50:24.3145392Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:24.3146420Z + conda run -n build_binary c++ --version 2025-05-07T19:50:24.3146723Z 2025-05-07T19:50:25.9110375Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:25.9112579Z Target: x86_64-conda-linux-gnu 2025-05-07T19:50:25.9112878Z Thread model: posix 2025-05-07T19:50:25.9113180Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:50:25.9133338Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:25.9133895Z 2025-05-07T19:50:25.9669701Z 2025-05-07T19:50:25.9670473Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:25.9671304Z 2025-05-07T19:50:27.6223845Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:27.6224802Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:27.6225276Z 2025-05-07T19:50:27.6225478Z [BUILD] Clang is available; configuring for Clang-based build ... 2025-05-07T19:50:29.2582937Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:50:29.2584515Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:50:29.2587874Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 --cxxprefix=/github/home/miniconda/envs/build_binary --debug 2025-05-07T19:50:29.2590435Z ################################################################################ 2025-05-07T19:50:29.2590772Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:50:29.2591064Z # 2025-05-07T19:50:29.2610726Z # [2025-05-07T19:50:29.260Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:50:29.2611684Z ################################################################################ 2025-05-07T19:50:29.2611984Z 2025-05-07T19:50:29.2612186Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:50:29.2617271Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' --config-setting=--build-option=-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCMAKE_CXX_STANDARD=20 --config-setting=--build-option=--cxxprefix=/github/home/miniconda/envs/build_binary --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py310 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:50:29.2622706Z 2025-05-07T19:50:30.8977054Z * Getting build dependencies for wheel... 2025-05-07T19:50:32.2481002Z INFO:root:running egg_info 2025-05-07T19:50:32.2503946Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:50:32.2505097Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:50:32.2508035Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:50:32.2509919Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:50:32.2510842Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:50:32.2512090Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:32.2573032Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:32.2584977Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:32.2587901Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:50:32.2589195Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:32.2590299Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:50:32.2590793Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:32.2591376Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:32.2591972Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:32.2592558Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:32.2593091Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:32.2594349Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:50:32.5821069Z * Building wheel... 2025-05-07T19:50:33.9051055Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-gh9ovt89', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--cxxprefix=/github/home/miniconda/envs/build_binary', '--debug', '--package_channel=nightly', '--python-tag=py310', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:33.9055586Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix='/github/home/miniconda/envs/build_binary') 2025-05-07T19:50:33.9058615Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-gh9ovt89', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--python-tag=py310', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:33.9060721Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:33.9061308Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:33.9061915Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:33.9062493Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:33.9063265Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:33.9069511Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc', '-DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20'] 2025-05-07T19:50:33.9075243Z 2025-05-07T19:50:33.9075248Z 2025-05-07T19:50:33.9075411Z -------------------------------------------------------------------------------- 2025-05-07T19:50:33.9075794Z -- Trying 'Ninja' generator 2025-05-07T19:50:33.9076048Z -------------------------------- 2025-05-07T19:50:33.9076316Z --------------------------- 2025-05-07T19:50:33.9076546Z ---------------------- 2025-05-07T19:50:33.9076775Z ----------------- 2025-05-07T19:50:33.9076979Z ------------ 2025-05-07T19:50:33.9077190Z ------- 2025-05-07T19:50:33.9077373Z -- 2025-05-07T19:50:33.9462085Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:50:33.9463792Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:50:33.9464987Z CMake. 2025-05-07T19:50:33.9465306Z 2025-05-07T19:50:33.9466005Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:50:33.9467438Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:50:33.9467908Z to work with policies introduced by or earlier. 2025-05-07T19:50:33.9468146Z 2025-05-07T19:50:33.9468150Z 2025-05-07T19:50:33.9468349Z Not searching for unused variables given on the command line. 2025-05-07T19:50:34.0335970Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:50:34.0443571Z -- Detecting C compiler ABI info 2025-05-07T19:50:34.1737581Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:34.1861786Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:50:34.1863332Z -- Detecting C compile features 2025-05-07T19:50:34.1864259Z -- Detecting C compile features - done 2025-05-07T19:50:34.3383524Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:50:34.3465961Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:34.5013119Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:34.5140434Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:50:34.5142003Z -- Detecting CXX compile features 2025-05-07T19:50:34.5147550Z -- Detecting CXX compile features - done 2025-05-07T19:50:34.5164302Z -- Configuring done (0.6s) 2025-05-07T19:50:34.5215398Z -- Generating done (0.0s) 2025-05-07T19:50:34.5227521Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:50:34.5269468Z -- 2025-05-07T19:50:34.5270091Z ------- 2025-05-07T19:50:34.5271166Z ------------ 2025-05-07T19:50:34.5271732Z ----------------- 2025-05-07T19:50:34.5272350Z ---------------------- 2025-05-07T19:50:34.5273001Z --------------------------- 2025-05-07T19:50:34.5273715Z -------------------------------- 2025-05-07T19:50:34.5274505Z -- Trying 'Ninja' generator - success 2025-05-07T19:50:34.5275806Z -------------------------------------------------------------------------------- 2025-05-07T19:50:34.5276634Z 2025-05-07T19:50:34.5281623Z Configuring Project 2025-05-07T19:50:34.5281948Z Working directory: 2025-05-07T19:50:34.5282975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build 2025-05-07T19:50:34.5283455Z Command: 2025-05-07T19:50:34.5297267Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install -DPYTHON_VERSION_STRING:STRING=3.10.17 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.10 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.10.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.10 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.10 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 -DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc -DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++ '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 -DCMAKE_BUILD_TYPE:STRING=Release 2025-05-07T19:50:34.5311122Z 2025-05-07T19:50:34.5756419Z 2025-05-07T19:50:34.5756900Z Not searching for unused variables given on the command line. 2025-05-07T19:50:34.5757252Z 2025-05-07T19:50:34.5757421Z ================================================================================ 2025-05-07T19:50:34.5757798Z Default C compiler flags 2025-05-07T19:50:34.5758214Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:34.5758845Z 2025-05-07T19:50:34.5759719Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:50:34.5760788Z ================================================================================ 2025-05-07T19:50:34.5761018Z 2025-05-07T19:50:34.5761023Z 2025-05-07T19:50:34.5761027Z 2025-05-07T19:50:34.5761155Z ================================================================================ 2025-05-07T19:50:34.5761483Z Default C++ compiler flags 2025-05-07T19:50:34.5761869Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:34.5762170Z 2025-05-07T19:50:34.5763006Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:50:34.5764064Z ================================================================================ 2025-05-07T19:50:34.5764293Z 2025-05-07T19:50:34.5764297Z 2025-05-07T19:50:34.5764317Z 2025-05-07T19:50:34.5764427Z ================================================================================ 2025-05-07T19:50:34.5764735Z AVX2_FLAGS: 2025-05-07T19:50:34.5764871Z 2025-05-07T19:50:34.5764951Z -mavx2 2025-05-07T19:50:34.5765142Z -mf16c 2025-05-07T19:50:34.5765344Z -mfma 2025-05-07T19:50:34.5765547Z -fopenmp 2025-05-07T19:50:34.5765769Z ================================================================================ 2025-05-07T19:50:34.5766003Z 2025-05-07T19:50:34.5766006Z 2025-05-07T19:50:34.5766010Z 2025-05-07T19:50:34.5766135Z ================================================================================ 2025-05-07T19:50:34.5766440Z AVX512_FLAGS: 2025-05-07T19:50:34.5766582Z 2025-05-07T19:50:34.5766662Z -mavx2 2025-05-07T19:50:34.5766845Z -mf16c 2025-05-07T19:50:34.5767047Z -mfma 2025-05-07T19:50:34.5767232Z -mavx512f 2025-05-07T19:50:34.5767451Z -mavx512bw 2025-05-07T19:50:34.5767661Z -mavx512dq 2025-05-07T19:50:34.5767854Z -mavx512vl 2025-05-07T19:50:34.5768064Z -fopenmp 2025-05-07T19:50:34.5768283Z ================================================================================ 2025-05-07T19:50:34.5768511Z 2025-05-07T19:50:34.5768530Z 2025-05-07T19:50:34.5768534Z 2025-05-07T19:50:34.5768645Z ================================================================================ 2025-05-07T19:50:34.5768981Z The project is built using scikit-build 2025-05-07T19:50:34.5769318Z ================================================================================ 2025-05-07T19:50:34.5769550Z 2025-05-07T19:50:34.5769554Z 2025-05-07T19:50:34.5769557Z 2025-05-07T19:50:34.5769682Z ================================================================================ 2025-05-07T19:50:34.5770095Z Build Settings 2025-05-07T19:50:34.5770240Z 2025-05-07T19:50:34.5770340Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:50:34.5770618Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:50:34.5770809Z 2025-05-07T19:50:34.5770899Z NVCC_VERBOSE : 2025-05-07T19:50:34.5771157Z CUDNN_INCLUDE_DIR : 2025-05-07T19:50:34.5771395Z CUDNN_LIBRARY : 2025-05-07T19:50:34.5771824Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.5772286Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:50:34.5772546Z 8.0 2025-05-07T19:50:34.5772723Z 9.0 2025-05-07T19:50:34.5772914Z 9.0a 2025-05-07T19:50:34.5773015Z 2025-05-07T19:50:34.5773105Z HIP_ROOT_DIR : 2025-05-07T19:50:34.5773360Z HIPCC_VERBOSE : 2025-05-07T19:50:34.5773758Z AMDGPU_TARGETS : 2025-05-07T19:50:34.5774020Z PYTORCH_ROCM_ARCH : 2025-05-07T19:50:34.5774298Z ================================================================================ 2025-05-07T19:50:34.5774520Z 2025-05-07T19:50:34.7267314Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:50:34.7965845Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:50:35.8454556Z -- The CUDA compiler identification is NVIDIA 12.6.85 with host compiler Clang 16.0.6 2025-05-07T19:50:35.8564943Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:36.0020743Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:36.0146876Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:50:36.0148454Z -- Detecting CXX compile features 2025-05-07T19:50:36.0155527Z -- Detecting CXX compile features - done 2025-05-07T19:50:36.0231007Z -- Detecting C compiler ABI info 2025-05-07T19:50:36.1455041Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:36.1578275Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:50:36.1578924Z -- Detecting C compile features 2025-05-07T19:50:36.1583523Z -- Detecting C compile features - done 2025-05-07T19:50:36.1630501Z -- Detecting CUDA compiler ABI info 2025-05-07T19:50:37.1642782Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:50:37.2171401Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:50:37.2201375Z -- Detecting CUDA compile features 2025-05-07T19:50:37.2202710Z -- Detecting CUDA compile features - done 2025-05-07T19:50:37.2224588Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:50:37.5086214Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:50:37.5087291Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:50:37.8351181Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:50:37.8351962Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:50:38.1186958Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:50:38.1188010Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:50:38.4427261Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:50:38.4428315Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:50:38.7288439Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:50:38.7289472Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:50:39.0558365Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:50:39.0559424Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:50:39.3409225Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:50:39.3410304Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:50:39.6661675Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:50:39.6662851Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:50:39.9483079Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:50:39.9484153Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:50:40.2747263Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:50:40.2748313Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:50:40.5576271Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:50:40.5577356Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:50:40.8828135Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:50:40.9001918Z -- Found CUDA: /github/home/miniconda/envs/build_binary/targets/x86_64-linux (found version "12.6") 2025-05-07T19:50:40.9034984Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include (found version "12.6.85") 2025-05-07T19:50:40.9101296Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:50:41.0322774Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success 2025-05-07T19:50:41.0330704Z -- Found Threads: TRUE 2025-05-07T19:50:41.0342511Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Caffe2/FindCUDAToolkit.cmake:957 (message): 2025-05-07T19:50:41.0343518Z Could not find librt library, needed by CUDA::cudart_static 2025-05-07T19:50:41.0343931Z Call Stack (most recent call first): 2025-05-07T19:50:41.0345029Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:59 (find_package) 2025-05-07T19:50:41.0346156Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:41.0347643Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:41.0348460Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:41.0348921Z CMakeLists.txt:112 (include) 2025-05-07T19:50:41.0349100Z 2025-05-07T19:50:41.0349104Z 2025-05-07T19:50:41.1619753Z -- PyTorch: CUDA detected: 12.6 2025-05-07T19:50:41.1621315Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/bin/nvcc 2025-05-07T19:50:41.1622499Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary/targets/x86_64-linux 2025-05-07T19:50:41.3318448Z -- PyTorch: Header version is: 12.6 2025-05-07T19:50:41.4178419Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.10.17") found components: Interpreter 2025-05-07T19:50:41.4201622Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:50:41.4202553Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:50:41.4202993Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:50:41.4203375Z Call Stack (most recent call first): 2025-05-07T19:50:41.4204091Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:41.4205246Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:41.4206122Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:41.4206630Z CMakeLists.txt:112 (include) 2025-05-07T19:50:41.4206831Z 2025-05-07T19:50:41.4206836Z 2025-05-07T19:50:41.4207081Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:50:41.4207706Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:50:41.4208183Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:50:41.4209090Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_90,code=sm_90;-gencode;arch=compute_90a,code=sm_90a 2025-05-07T19:50:41.4561567Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:50:41.4562467Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:50:41.4562851Z Call Stack (most recent call first): 2025-05-07T19:50:41.4563675Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:50:41.4564645Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:41.4565136Z CMakeLists.txt:112 (include) 2025-05-07T19:50:41.4565331Z 2025-05-07T19:50:41.4565336Z 2025-05-07T19:50:41.4565801Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so 2025-05-07T19:50:41.4566318Z 2025-05-07T19:50:41.4566322Z 2025-05-07T19:50:41.4566451Z ================================================================================ 2025-05-07T19:50:41.4566823Z PyTorch Flags: 2025-05-07T19:50:41.4567091Z 2025-05-07T19:50:41.4567310Z TORCH_INCLUDE_DIRS: 2025-05-07T19:50:41.4567777Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:41.4568589Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:41.4569451Z 2025-05-07T19:50:41.4569670Z TORCH_LIBRARIES: 2025-05-07T19:50:41.4569945Z torch 2025-05-07T19:50:41.4570160Z torch_library 2025-05-07T19:50:41.4570648Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:41.4571374Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:41.4572193Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:41.4572772Z 2025-05-07T19:50:41.4572986Z TORCH_CUDA_OPTIONS: 2025-05-07T19:50:41.4573278Z --expt-relaxed-constexpr 2025-05-07T19:50:41.4573573Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:41.4573897Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:41.4574207Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:41.4574541Z ================================================================================ 2025-05-07T19:50:41.4574779Z 2025-05-07T19:50:41.4574800Z 2025-05-07T19:50:41.4574827Z 2025-05-07T19:50:41.4574958Z ================================================================================ 2025-05-07T19:50:41.4575294Z NCCL Flags 2025-05-07T19:50:41.4575442Z 2025-05-07T19:50:41.4575831Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:41.4576772Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:41.4577418Z ================================================================================ 2025-05-07T19:50:41.4577682Z 2025-05-07T19:50:41.4577686Z 2025-05-07T19:50:41.4577690Z 2025-05-07T19:50:41.4577806Z ================================================================================ 2025-05-07T19:50:41.4578131Z CUDA Driver Path 2025-05-07T19:50:41.4578302Z 2025-05-07T19:50:41.4578668Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:41.4579279Z ================================================================================ 2025-05-07T19:50:41.4579633Z 2025-05-07T19:50:41.4579930Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:41.4603323Z 2025-05-07T19:50:41.4603441Z 2025-05-07T19:50:41.4603705Z ================================================================================ 2025-05-07T19:50:41.4605341Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:50:41.4605730Z 2025-05-07T19:50:41.4606020Z CPU_SRCS: 2025-05-07T19:50:41.4606141Z 2025-05-07T19:50:41.4606251Z 2025-05-07T19:50:41.4606452Z GPU_SRCS: 2025-05-07T19:50:41.4606602Z 2025-05-07T19:50:41.4606697Z 2025-05-07T19:50:41.4606893Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:41.4607040Z 2025-05-07T19:50:41.4607132Z 2025-05-07T19:50:41.4607318Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:41.4607455Z 2025-05-07T19:50:41.4607546Z 2025-05-07T19:50:41.4607723Z OTHER_SRCS: 2025-05-07T19:50:41.4608115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:41.4608753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:41.4609362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:41.4610084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:41.4610685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:41.4611265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:41.4611822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:41.4612399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:41.4612963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:41.4613537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:41.4614123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:41.4614939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:41.4615697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:41.4616274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:41.4618507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:41.4619208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:41.4620023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:41.4620716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:41.4621294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:41.4621886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:41.4622645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:41.4623247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:41.4623873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:41.4624488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:41.4625106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:41.4625669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:41.4626273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:41.4626885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:41.4627435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:41.4628101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:41.4628775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:41.4629343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:41.4629874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:41.4630406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:41.4630942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:41.4631460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:41.4631990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:41.4632508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:41.4633039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:41.4633559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:41.4634087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:41.4634611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:41.4635128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:41.4635660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:41.4636177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:41.4636737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:41.4637279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:41.4637829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:41.4638379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:41.4639138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:41.4639692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:41.4640273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:41.4640941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:41.4641516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:41.4642068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:41.4642599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:41.4643152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:41.4643689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:41.4644243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:41.4644631Z 2025-05-07T19:50:41.4644821Z CC_FLAGS: 2025-05-07T19:50:41.4644928Z 2025-05-07T19:50:41.4645002Z 2025-05-07T19:50:41.4645187Z NVCC_FLAGS: 2025-05-07T19:50:41.4645299Z 2025-05-07T19:50:41.4645387Z 2025-05-07T19:50:41.4645557Z HIPCC_FLAGS: 2025-05-07T19:50:41.4645676Z 2025-05-07T19:50:41.4645767Z 2025-05-07T19:50:41.4645940Z INCLUDE_DIRS: 2025-05-07T19:50:41.4646172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:41.4646637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:41.4646929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:41.4647220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:41.4647723Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:41.4648503Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:41.4649133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:41.4649548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:41.4649962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:41.4650427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:41.4650922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:41.4651375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:41.4651923Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:41.4652402Z 2025-05-07T19:50:41.4652603Z Selected Source Files: 2025-05-07T19:50:41.4652749Z 2025-05-07T19:50:41.4652822Z 2025-05-07T19:50:41.4653016Z HIPified Source Files: 2025-05-07T19:50:41.4653159Z 2025-05-07T19:50:41.4653232Z 2025-05-07T19:50:41.4653429Z Library Dependencies: 2025-05-07T19:50:41.4653645Z torch 2025-05-07T19:50:41.4653939Z torch_library 2025-05-07T19:50:41.4654338Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:41.4654971Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:41.4655619Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:41.4656355Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:41.4657047Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:41.4657597Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:41.4657972Z 2025-05-07T19:50:41.4658143Z Output Library: 2025-05-07T19:50:41.4658349Z asmjit 2025-05-07T19:50:41.4658531Z 2025-05-07T19:50:41.4658707Z Destination Directory: 2025-05-07T19:50:41.4658934Z fbgemm_gpu 2025-05-07T19:50:41.4659172Z ================================================================================ 2025-05-07T19:50:41.4659559Z 2025-05-07T19:50:41.4659563Z 2025-05-07T19:50:41.4659567Z 2025-05-07T19:50:41.4659673Z ================================================================================ 2025-05-07T19:50:41.4660174Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:50:41.4660457Z 2025-05-07T19:50:41.4660652Z CPU_SRCS: 2025-05-07T19:50:41.4660831Z 2025-05-07T19:50:41.4660909Z 2025-05-07T19:50:41.4661177Z GPU_SRCS: 2025-05-07T19:50:41.4661289Z 2025-05-07T19:50:41.4661363Z 2025-05-07T19:50:41.4661558Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:41.4661695Z 2025-05-07T19:50:41.4661784Z 2025-05-07T19:50:41.4661967Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:41.4662103Z 2025-05-07T19:50:41.4662194Z 2025-05-07T19:50:41.4662371Z OTHER_SRCS: 2025-05-07T19:50:41.4662649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:50:41.4663089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:41.4663560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:41.4663965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:50:41.4664379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:50:41.4664863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:41.4665314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:50:41.4665692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:50:41.4666173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:41.4666575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:41.4666955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:41.4667350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:41.4667739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:41.4668078Z 2025-05-07T19:50:41.4668255Z CC_FLAGS: 2025-05-07T19:50:41.4668359Z 2025-05-07T19:50:41.4668428Z 2025-05-07T19:50:41.4668606Z NVCC_FLAGS: 2025-05-07T19:50:41.4668715Z 2025-05-07T19:50:41.4668785Z 2025-05-07T19:50:41.4668960Z HIPCC_FLAGS: 2025-05-07T19:50:41.4669074Z 2025-05-07T19:50:41.4669143Z 2025-05-07T19:50:41.4669319Z INCLUDE_DIRS: 2025-05-07T19:50:41.4669528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:41.4669825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:41.4670079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:41.4670376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:41.4670841Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:41.4671564Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:41.4672173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:41.4672550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:41.4672952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:41.4673377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:41.4673870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:41.4674299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:41.4674804Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:41.4675274Z 2025-05-07T19:50:41.4675456Z Selected Source Files: 2025-05-07T19:50:41.4675608Z 2025-05-07T19:50:41.4675681Z 2025-05-07T19:50:41.4675860Z HIPified Source Files: 2025-05-07T19:50:41.4676013Z 2025-05-07T19:50:41.4676086Z 2025-05-07T19:50:41.4676264Z Library Dependencies: 2025-05-07T19:50:41.4676488Z torch 2025-05-07T19:50:41.4676678Z torch_library 2025-05-07T19:50:41.4677077Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:41.4677713Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:41.4678352Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:41.4679176Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:41.4679855Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:41.4680292Z asmjit 2025-05-07T19:50:41.4680669Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:41.4681035Z 2025-05-07T19:50:41.4681228Z Output Library: 2025-05-07T19:50:41.4681425Z fbgemm 2025-05-07T19:50:41.4681612Z 2025-05-07T19:50:41.4681793Z Destination Directory: 2025-05-07T19:50:41.4682029Z fbgemm_gpu 2025-05-07T19:50:41.4682237Z ================================================================================ 2025-05-07T19:50:41.4682466Z 2025-05-07T19:50:41.4682470Z 2025-05-07T19:50:41.4682473Z 2025-05-07T19:50:41.4682576Z ================================================================================ 2025-05-07T19:50:41.4682896Z Running code generation script ... 2025-05-07T19:50:41.4683590Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:50:41.4684310Z ================================================================================ 2025-05-07T19:50:41.4684521Z 2025-05-07T19:50:42.0931258Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:42.0933897Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:50:42.0936057Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:42.0937224Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:42.0937697Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.0938173Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:42.0938644Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:42.0939117Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:42.0939701Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:42.0940380Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.0940938Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:42.0941453Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:42.0941962Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.0942496Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:42.0943025Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.0943597Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.0944153Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:42.0944691Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.0945235Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:42.0945775Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.0946365Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.0946911Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:42.0947435Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:50:42.0947882Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:42.0948266Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:42.0948715Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:42.0949217Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.0949746Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:42.0950486Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:42.0951021Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.0951567Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:42.0952080Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:42.0952892Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.0953419Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:42.0953934Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:42.0954454Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.0955005Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:42.0955503Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:50:42.0955920Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:42.0956309Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:42.0956732Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.0957133Z Written: lookup_adagrad.py 2025-05-07T19:50:42.0957428Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:42.0957820Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:42.0958240Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.0958706Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:42.0959151Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:42.0959594Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.0960069Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:42.0960513Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:42.0960969Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:42.0961403Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:42.0961873Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.0962364Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:42.0962820Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:42.0963300Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.0963775Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:42.0964281Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.0964801Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.0965312Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:42.0965816Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.0966296Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:42.0966801Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.0967319Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.0967838Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:42.0968294Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:50:42.0968699Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:42.0969060Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:42.0969470Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.0969893Z Written: lookup_adam.py 2025-05-07T19:50:42.0970183Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:42.0970623Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.0971153Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:42.0971650Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.0972155Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:42.0972608Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:42.0973181Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.0973660Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:42.0974153Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:42.0974656Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.0975207Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:42.0975722Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:42.0976233Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.0976786Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:42.0977263Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:50:42.0977700Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:42.0978067Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:42.0978533Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.0978945Z Written: lookup_lamb.py 2025-05-07T19:50:42.0979240Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:42.0979785Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.0980464Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:42.0981024Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.0981566Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:42.0982099Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:42.0982644Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.0983213Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:42.0983771Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:42.0984339Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.0984922Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:42.0985463Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:42.0986039Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.0986632Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:42.0987158Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:50:42.0987617Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:42.0988022Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:42.0988499Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.0988912Z Written: lookup_lars_sgd.py 2025-05-07T19:50:42.0989276Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:42.0989753Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.0990337Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:42.0990981Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.0991616Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:42.0992237Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:42.0992935Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.0993523Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:42.0994207Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:42.0994816Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.0995453Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:42.0996108Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:42.0996739Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.0997361Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:42.2026071Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:50:42.2027775Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:42.2028519Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:42.2029103Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.2029575Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:50:42.2030096Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:42.2030611Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.2031183Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:42.2031741Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.2032322Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:42.2032882Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:42.2033446Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.2034044Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:42.2034618Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:42.2035237Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.2035867Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:42.2036454Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:42.2037080Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.2037699Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:42.2038287Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:50:42.2038789Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:42.2039261Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:42.2039798Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.2040239Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:50:42.2040633Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:42.2041142Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.2041700Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:42.2042220Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:42.2042740Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:42.2043243Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:42.2043761Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.2044321Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.2044859Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:42.2045662Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:42.2046181Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:42.2046700Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:42.2047352Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:42.2047880Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:42.2048405Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:50:42.2048900Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:42.2049438Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.2050003Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.2050545Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:42.2051098Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:42.2051623Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:50:42.2052148Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:42.2052683Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.2053257Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.2053816Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:42.2054341Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:42.2054915Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.2055503Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.2056105Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.2056678Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.2057256Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:42.2057820Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:42.2058386Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.2058967Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.2059640Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:42.2060404Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:42.2061032Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.2061672Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.2062334Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.2062963Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.2063597Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:42.2064202Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:42.2064844Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:42.2065494Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:42.2066129Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:42.2066784Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:42.2067420Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:42.2068170Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:42.2068832Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:42.2069573Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:42.2070202Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:42.2070779Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:42.2071385Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:42.2071976Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:42.2072648Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:50:42.2073171Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:42.2073633Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:42.2074061Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:42.2074525Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.2074964Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:50:42.2075317Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:42.2075756Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:42.2076253Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.2076672Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:50:42.2077038Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:42.2077473Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:42.2077966Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.2078509Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:42.2079038Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:42.2079525Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:42.2080049Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.2080598Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:42.2081118Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.2081736Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:42.2082322Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:42.2082879Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:42.2083497Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.2084111Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:42.2084727Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.2085402Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:42.2086066Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:42.2086683Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:42.2087340Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.3284700Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:42.3286964Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.3289034Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:42.3290532Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:42.3291168Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.3291937Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:42.3292582Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:42.3293191Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:42.3293826Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:42.3294449Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.3295112Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:42.3295742Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:42.3296400Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.3297064Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:42.3297729Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.3298427Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.3299082Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:42.3300054Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.3300842Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:42.3301568Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.3302331Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.3303048Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:42.3303751Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:42.3304371Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:42.3304921Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:42.3305560Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.3306207Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:50:42.3306659Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:42.3307224Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.3307871Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:42.3308486Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:42.3309061Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:42.3309700Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.3310326Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:42.3310974Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.3311601Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:50:42.3312157Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:42.3312653Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:42.3313338Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.3313910Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:42.3314457Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.3315062Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:42.3315495Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:42.3315967Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.3316432Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:42.3316894Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:42.3317354Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:42.3317795Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:42.3318265Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.3318741Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:42.3319216Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:42.3319685Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.3320177Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:42.3320683Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.3321196Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:42.3321707Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:42.3322573Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.3323165Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:42.3323693Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.3324270Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:42.3324816Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:42.3325297Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:50:42.3325729Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:42.3326101Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:42.3326550Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.3326940Z Written: lookup_sgd.py 2025-05-07T19:50:42.3327246Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:42.3327643Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:42.3328065Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.3328805Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:50:42.3329238Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:42.3329658Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:42.3330105Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.3330575Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:42.3331032Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.3331489Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:42.3331953Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:42.3332406Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:42.3332854Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:42.3333303Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:42.3333777Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:42.3334228Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:42.3334858Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:42.3335383Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:42.3335858Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:42.3336375Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:42.3336963Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:42.3337441Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:50:42.3337834Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:50:42.3338202Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:42.3338631Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.3339002Z Written: lookup_none.py 2025-05-07T19:50:42.3339299Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:42.3339785Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.3340481Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:42.3341034Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:42.3341625Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:42.3342169Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:42.3342677Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:42.3343189Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:50:42.3343666Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:50:42.3344191Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:42.3344731Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:42.3345280Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:42.3345824Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:42.3346430Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:50:42.3346899Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:50:42.3347348Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:42.3347795Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:42.3348242Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:42.3348727Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:42.3349209Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:42.3349590Z Written: pt2_arg_utils.h 2025-05-07T19:50:42.3349839Z Written: __init__.py 2025-05-07T19:50:42.3350066Z Written: lookup_args_ssd.py 2025-05-07T19:50:42.3350327Z Written: lookup_args.py 2025-05-07T19:50:42.3397073Z 2025-05-07T19:50:42.3397198Z 2025-05-07T19:50:42.3397460Z ================================================================================ 2025-05-07T19:50:42.3397866Z Running code generation script ... 2025-05-07T19:50:42.3398660Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:50:42.3399455Z ================================================================================ 2025-05-07T19:50:42.3399703Z 2025-05-07T19:50:42.4480849Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:42.4483410Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:50:42.4485560Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:42.4486977Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:42.4487959Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:42.4488738Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:42.4489196Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:50:42.4489572Z Written: optimizer_args.py 2025-05-07T19:50:42.4566088Z 2025-05-07T19:50:42.4566229Z 2025-05-07T19:50:42.4567158Z ================================================================================ 2025-05-07T19:50:42.4568268Z Running code generation script ... 2025-05-07T19:50:42.4569766Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:50:42.4570669Z ================================================================================ 2025-05-07T19:50:42.4570888Z 2025-05-07T19:50:42.5802897Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:42.5805562Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:50:42.5808154Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:42.5809689Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:42.5810354Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:42.5811002Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:42.5811660Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:42.5812304Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:42.5813002Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:42.5813732Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:42.5814432Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:42.5815146Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:42.5815833Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:42.5816570Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:42.5817271Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:42.5817916Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:42.5818577Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:42.5819218Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:42.5820198Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:42.5820914Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:42.5821576Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:42.5822442Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:42.5823112Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:42.5823705Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:42.5824216Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:42.5884377Z 2025-05-07T19:50:42.5884537Z 2025-05-07T19:50:42.5885034Z ================================================================================ 2025-05-07T19:50:42.5886111Z Running code generation script ... 2025-05-07T19:50:42.5888333Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:50:42.5890937Z ================================================================================ 2025-05-07T19:50:42.5891172Z 2025-05-07T19:50:42.9788538Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:42.9870073Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:50:42.9870833Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:42.9871313Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:42.9871817Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:42.9872432Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:42.9872888Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:42.9873363Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:42.9873819Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:42.9874272Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:42.9874730Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:42.9875227Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:42.9875703Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:42.9876149Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:42.9876640Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:42.9877123Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:42.9877627Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:42.9878131Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:42.9878627Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:42.9879105Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:42.9879583Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:42.9880076Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:42.9880536Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:42.9881016Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:42.9881473Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:42.9881930Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:42.9882401Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:42.9882879Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:42.9883357Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:42.9883812Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:42.9884271Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:42.9884683Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:42.9885118Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:42.9885754Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:42.9886198Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:42.9886636Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:42.9887064Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:42.9887602Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:42.9887992Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:42.9888417Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:42.9888876Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:42.9889452Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:42.9890069Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:42.9890500Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:42.9890928Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:42.9891433Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:42.9891901Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:42.9892359Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:42.9892841Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:42.9893298Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:42.9893736Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:42.9894238Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:42.9894758Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:42.9895277Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:42.9895782Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:42.9896266Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.9896707Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:42.9897120Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:42.9897414Z 2025-05-07T19:50:42.9897418Z 2025-05-07T19:50:42.9897530Z ================================================================================ 2025-05-07T19:50:42.9897867Z Running code generation script ... 2025-05-07T19:50:42.9898619Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:50:42.9899385Z ================================================================================ 2025-05-07T19:50:42.9899709Z 2025-05-07T19:50:43.2792824Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:43.2794468Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:50:43.2795474Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:43.2795905Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:43.2796351Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:43.2796795Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:43.2797269Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:43.2797710Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:43.2798228Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:50:43.2798759Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:43.2799222Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:43.2889159Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:50:43.2901153Z 2025-05-07T19:50:43.2901762Z 2025-05-07T19:50:43.2902240Z ================================================================================ 2025-05-07T19:50:43.2903481Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:50:43.2904516Z 2025-05-07T19:50:43.2905199Z CPU_SRCS: 2025-05-07T19:50:43.2905623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:43.2906327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:43.2907025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:43.2907752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:43.2908395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:43.2909328Z 2025-05-07T19:50:43.2909557Z GPU_SRCS: 2025-05-07T19:50:43.2909901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:43.2910483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:43.2911260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:43.2911910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:43.2912522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:43.2913080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:43.2913704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:43.2914268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:43.2915042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:43.2915732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:43.2916209Z 2025-05-07T19:50:43.2916450Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.2916600Z 2025-05-07T19:50:43.2916689Z 2025-05-07T19:50:43.2916932Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.2917082Z 2025-05-07T19:50:43.2917176Z 2025-05-07T19:50:43.2917410Z OTHER_SRCS: 2025-05-07T19:50:43.2917541Z 2025-05-07T19:50:43.2917628Z 2025-05-07T19:50:43.2917872Z CC_FLAGS: 2025-05-07T19:50:43.2917991Z 2025-05-07T19:50:43.2918087Z 2025-05-07T19:50:43.2918316Z NVCC_FLAGS: 2025-05-07T19:50:43.2918585Z --expt-relaxed-constexpr 2025-05-07T19:50:43.2918871Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.2919200Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.2919501Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.2919788Z 2025-05-07T19:50:43.2919991Z HIPCC_FLAGS: 2025-05-07T19:50:43.2920156Z 2025-05-07T19:50:43.2920246Z 2025-05-07T19:50:43.2920444Z INCLUDE_DIRS: 2025-05-07T19:50:43.2920722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.2921045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.2921361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.2921707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.2922616Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.2923463Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.2924136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.2924608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.2925059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.2925579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.2926149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.2926633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.2927239Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.2927766Z 2025-05-07T19:50:43.2928018Z Selected Source Files: 2025-05-07T19:50:43.2928583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:43.2929224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:43.2929847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:43.2930442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:43.2931051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:43.2931648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:43.2932241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:43.2933007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:43.2933654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:43.2934266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:43.2934939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:43.2935561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:43.2936119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:43.2936702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:43.2937321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:43.2937811Z 2025-05-07T19:50:43.2938055Z HIPified Source Files: 2025-05-07T19:50:43.2938214Z 2025-05-07T19:50:43.2938300Z 2025-05-07T19:50:43.2938541Z Library Dependencies: 2025-05-07T19:50:43.2938781Z torch 2025-05-07T19:50:43.2939023Z torch_library 2025-05-07T19:50:43.2939531Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.2940430Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.2941204Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.2942058Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.2942840Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.2943468Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.2943914Z 2025-05-07T19:50:43.2944125Z Output Library: 2025-05-07T19:50:43.2944394Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:43.2944629Z 2025-05-07T19:50:43.2944879Z Destination Directory: 2025-05-07T19:50:43.2945133Z fbgemm_gpu 2025-05-07T19:50:43.2945405Z ================================================================================ 2025-05-07T19:50:43.2945643Z 2025-05-07T19:50:43.3414580Z 2025-05-07T19:50:43.3414925Z 2025-05-07T19:50:43.3415365Z ================================================================================ 2025-05-07T19:50:43.3415858Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:50:43.3416307Z 2025-05-07T19:50:43.3416543Z CPU_SRCS: 2025-05-07T19:50:43.3416865Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:43.3417367Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:43.3417832Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:43.3418224Z 2025-05-07T19:50:43.3418435Z GPU_SRCS: 2025-05-07T19:50:43.3418766Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:43.3419281Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:43.3420006Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:43.3420688Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:43.3421312Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:43.3422170Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:43.3422805Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:43.3423466Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:43.3424148Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:43.3424840Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:43.3425551Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:43.3427883Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:43.3428542Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:43.3429193Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:43.3429896Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:43.3430530Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:43.3431154Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:43.3431746Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:43.3432366Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:43.3432958Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:43.3433549Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:43.3434121Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:43.3434707Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:43.3435129Z 2025-05-07T19:50:43.3435323Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.3435468Z 2025-05-07T19:50:43.3435574Z 2025-05-07T19:50:43.3435765Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.3435903Z 2025-05-07T19:50:43.3436012Z 2025-05-07T19:50:43.3436212Z OTHER_SRCS: 2025-05-07T19:50:43.3436365Z 2025-05-07T19:50:43.3436451Z 2025-05-07T19:50:43.3436639Z CC_FLAGS: 2025-05-07T19:50:43.3436779Z 2025-05-07T19:50:43.3436856Z 2025-05-07T19:50:43.3437046Z NVCC_FLAGS: 2025-05-07T19:50:43.3437298Z --expt-relaxed-constexpr 2025-05-07T19:50:43.3437599Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.3437873Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.3438187Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.3438439Z 2025-05-07T19:50:43.3438650Z HIPCC_FLAGS: 2025-05-07T19:50:43.3438774Z 2025-05-07T19:50:43.3438857Z 2025-05-07T19:50:43.3439071Z INCLUDE_DIRS: 2025-05-07T19:50:43.3439307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.3439637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.3439912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.3440227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.3440722Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.3441460Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.3442103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.3442495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.3442929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.3443381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.3443899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.3444367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.3444893Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.3445395Z 2025-05-07T19:50:43.3445599Z Selected Source Files: 2025-05-07T19:50:43.3445943Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:43.3446376Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:43.3446820Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:43.3447233Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:43.3447704Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:43.3448255Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:43.3448835Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:43.3449543Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:43.3450123Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:43.3450731Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:43.3451406Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:43.3452005Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:43.3452659Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:43.3453285Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:43.3453931Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:43.3454563Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:43.3455221Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:43.3455860Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:43.3456446Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:43.3457058Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:43.3457636Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:43.3458241Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:43.3459052Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:43.3459783Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:43.3460623Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:43.3461240Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:43.3461708Z 2025-05-07T19:50:43.3461920Z HIPified Source Files: 2025-05-07T19:50:43.3462108Z 2025-05-07T19:50:43.3462194Z 2025-05-07T19:50:43.3462404Z Library Dependencies: 2025-05-07T19:50:43.3462670Z torch 2025-05-07T19:50:43.3462899Z torch_library 2025-05-07T19:50:43.3463355Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.3464077Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.3464785Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.3465620Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.3466470Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.3466958Z asmjit 2025-05-07T19:50:43.3467188Z fbgemm 2025-05-07T19:50:43.3467404Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:43.3467687Z fbgemm_gpu_config 2025-05-07T19:50:43.3468045Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.3468483Z 2025-05-07T19:50:43.3468685Z Output Library: 2025-05-07T19:50:43.3468949Z fbgemm_gpu_tbe_inference 2025-05-07T19:50:43.3469195Z 2025-05-07T19:50:43.3469434Z Destination Directory: 2025-05-07T19:50:43.3469681Z fbgemm_gpu 2025-05-07T19:50:43.3469941Z ================================================================================ 2025-05-07T19:50:43.3470293Z 2025-05-07T19:50:43.5825432Z 2025-05-07T19:50:43.5825687Z 2025-05-07T19:50:43.5826017Z ================================================================================ 2025-05-07T19:50:43.5826585Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:50:43.5826940Z 2025-05-07T19:50:43.5827177Z CPU_SRCS: 2025-05-07T19:50:43.5827414Z src/config/feature_gates.cpp 2025-05-07T19:50:43.5827996Z 2025-05-07T19:50:43.5828209Z GPU_SRCS: 2025-05-07T19:50:43.5828364Z 2025-05-07T19:50:43.5828454Z 2025-05-07T19:50:43.5828675Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.5828853Z 2025-05-07T19:50:43.5828938Z 2025-05-07T19:50:43.5829153Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.5829323Z 2025-05-07T19:50:43.5829410Z 2025-05-07T19:50:43.5829652Z OTHER_SRCS: 2025-05-07T19:50:43.5829779Z 2025-05-07T19:50:43.5829958Z 2025-05-07T19:50:43.5830194Z CC_FLAGS: 2025-05-07T19:50:43.5830356Z 2025-05-07T19:50:43.5830448Z 2025-05-07T19:50:43.5830691Z NVCC_FLAGS: 2025-05-07T19:50:43.5830816Z 2025-05-07T19:50:43.5830907Z 2025-05-07T19:50:43.5831139Z HIPCC_FLAGS: 2025-05-07T19:50:43.5831275Z 2025-05-07T19:50:43.5831390Z 2025-05-07T19:50:43.5831595Z INCLUDE_DIRS: 2025-05-07T19:50:43.5831875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5832213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.5832672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.5833069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5833851Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.5834742Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.5835480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.5835953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.5836473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.5837002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.5837594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.5838103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.5838682Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.5839216Z 2025-05-07T19:50:43.5839450Z Selected Source Files: 2025-05-07T19:50:43.5839723Z src/config/feature_gates.cpp 2025-05-07T19:50:43.5840013Z 2025-05-07T19:50:43.5840229Z HIPified Source Files: 2025-05-07T19:50:43.5840390Z 2025-05-07T19:50:43.5840503Z 2025-05-07T19:50:43.5840719Z Library Dependencies: 2025-05-07T19:50:43.5840997Z torch 2025-05-07T19:50:43.5841204Z torch_library 2025-05-07T19:50:43.5841686Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.5842536Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.5843591Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.5844544Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.5845385Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.5846041Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.5846508Z 2025-05-07T19:50:43.5846746Z Output Library: 2025-05-07T19:50:43.5847041Z fbgemm_gpu_config 2025-05-07T19:50:43.5847296Z 2025-05-07T19:50:43.5847513Z Destination Directory: 2025-05-07T19:50:43.5847796Z fbgemm_gpu 2025-05-07T19:50:43.5848075Z ================================================================================ 2025-05-07T19:50:43.5848321Z 2025-05-07T19:50:43.5848382Z 2025-05-07T19:50:43.5848416Z 2025-05-07T19:50:43.5848535Z ================================================================================ 2025-05-07T19:50:43.5848928Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:50:43.5849308Z 2025-05-07T19:50:43.5849514Z CPU_SRCS: 2025-05-07T19:50:43.5849848Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:43.5850351Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:43.5850731Z 2025-05-07T19:50:43.5850960Z GPU_SRCS: 2025-05-07T19:50:43.5851245Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:43.5853522Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:43.5853949Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:43.5854446Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:43.5854887Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:43.5855345Z 2025-05-07T19:50:43.5855554Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.5855847Z 2025-05-07T19:50:43.5855946Z 2025-05-07T19:50:43.5856186Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.5856337Z 2025-05-07T19:50:43.5856426Z 2025-05-07T19:50:43.5856704Z OTHER_SRCS: 2025-05-07T19:50:43.5856828Z 2025-05-07T19:50:43.5856918Z 2025-05-07T19:50:43.5857135Z CC_FLAGS: 2025-05-07T19:50:43.5857254Z 2025-05-07T19:50:43.5857340Z 2025-05-07T19:50:43.5857550Z NVCC_FLAGS: 2025-05-07T19:50:43.5857674Z 2025-05-07T19:50:43.5857763Z 2025-05-07T19:50:43.5857984Z HIPCC_FLAGS: 2025-05-07T19:50:43.5858111Z 2025-05-07T19:50:43.5858221Z 2025-05-07T19:50:43.5858417Z INCLUDE_DIRS: 2025-05-07T19:50:43.5858690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5859015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.5859334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.5859732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5860447Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.5861261Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.5861953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.5862448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.5862919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.5863488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.5864084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.5864596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.5865235Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.5865844Z 2025-05-07T19:50:43.5866092Z Selected Source Files: 2025-05-07T19:50:43.5866594Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:43.5867065Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:43.5867513Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:43.5867953Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:43.5868353Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:43.5868758Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:43.5869171Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:43.5869552Z 2025-05-07T19:50:43.5869783Z HIPified Source Files: 2025-05-07T19:50:43.5869949Z 2025-05-07T19:50:43.5870037Z 2025-05-07T19:50:43.5870281Z Library Dependencies: 2025-05-07T19:50:43.5870525Z torch 2025-05-07T19:50:43.5870771Z torch_library 2025-05-07T19:50:43.5871213Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.5872014Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.5872722Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.5873519Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.5874223Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.5874816Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.5875206Z 2025-05-07T19:50:43.5875417Z Output Library: 2025-05-07T19:50:43.5875636Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:43.5875890Z 2025-05-07T19:50:43.5876102Z Destination Directory: 2025-05-07T19:50:43.5876375Z fbgemm_gpu 2025-05-07T19:50:43.5876613Z ================================================================================ 2025-05-07T19:50:43.5876990Z 2025-05-07T19:50:43.5876994Z 2025-05-07T19:50:43.5876997Z 2025-05-07T19:50:43.5877106Z ================================================================================ 2025-05-07T19:50:43.5877545Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:50:43.5877913Z 2025-05-07T19:50:43.5878201Z CPU_SRCS: 2025-05-07T19:50:43.5878434Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:43.5878758Z 2025-05-07T19:50:43.5878954Z GPU_SRCS: 2025-05-07T19:50:43.5879219Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:43.5879531Z 2025-05-07T19:50:43.5879735Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.5879876Z 2025-05-07T19:50:43.5879993Z 2025-05-07T19:50:43.5880192Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.5880339Z 2025-05-07T19:50:43.5880453Z 2025-05-07T19:50:43.5880637Z OTHER_SRCS: 2025-05-07T19:50:43.5880777Z 2025-05-07T19:50:43.5880856Z 2025-05-07T19:50:43.5881038Z CC_FLAGS: 2025-05-07T19:50:43.5881181Z 2025-05-07T19:50:43.5881262Z 2025-05-07T19:50:43.5881445Z NVCC_FLAGS: 2025-05-07T19:50:43.5881587Z 2025-05-07T19:50:43.5881668Z 2025-05-07T19:50:43.5881900Z HIPCC_FLAGS: 2025-05-07T19:50:43.5882024Z 2025-05-07T19:50:43.5882107Z 2025-05-07T19:50:43.5882318Z INCLUDE_DIRS: 2025-05-07T19:50:43.5882552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5882929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.5883238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.5883575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5884114Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.5884942Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.5885651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.5886121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.5886567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.5887127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.5887657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.5888155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.5888776Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.5889327Z 2025-05-07T19:50:43.5889544Z Selected Source Files: 2025-05-07T19:50:43.5889833Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:43.5890207Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:43.5890513Z 2025-05-07T19:50:43.5890715Z HIPified Source Files: 2025-05-07T19:50:43.5890939Z 2025-05-07T19:50:43.5891025Z 2025-05-07T19:50:43.5891229Z Library Dependencies: 2025-05-07T19:50:43.5891483Z torch 2025-05-07T19:50:43.5891679Z torch_library 2025-05-07T19:50:43.5892125Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.5892793Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.5893457Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.5894224Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.5894923Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.5895404Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:43.5895758Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.5896172Z 2025-05-07T19:50:43.5896371Z Output Library: 2025-05-07T19:50:43.5896632Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:43.5896910Z 2025-05-07T19:50:43.5897111Z Destination Directory: 2025-05-07T19:50:43.5897375Z fbgemm_gpu 2025-05-07T19:50:43.5897603Z ================================================================================ 2025-05-07T19:50:43.5897975Z 2025-05-07T19:50:43.5898033Z 2025-05-07T19:50:43.5898036Z 2025-05-07T19:50:43.5898150Z ================================================================================ 2025-05-07T19:50:43.5898552Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:50:43.5898915Z 2025-05-07T19:50:43.5899108Z CPU_SRCS: 2025-05-07T19:50:43.5899573Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:43.5900173Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:43.5900634Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:43.5900949Z 2025-05-07T19:50:43.5901183Z GPU_SRCS: 2025-05-07T19:50:43.5901426Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:43.5901813Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:43.5902174Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:43.5902539Z 2025-05-07T19:50:43.5902773Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.5902923Z 2025-05-07T19:50:43.5903009Z 2025-05-07T19:50:43.5903250Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.5903403Z 2025-05-07T19:50:43.5903495Z 2025-05-07T19:50:43.5903734Z OTHER_SRCS: 2025-05-07T19:50:43.5903873Z 2025-05-07T19:50:43.5903970Z 2025-05-07T19:50:43.5904202Z CC_FLAGS: 2025-05-07T19:50:43.5904329Z 2025-05-07T19:50:43.5904416Z 2025-05-07T19:50:43.5904643Z NVCC_FLAGS: 2025-05-07T19:50:43.5904882Z --expt-relaxed-constexpr 2025-05-07T19:50:43.5905207Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.5905513Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.5905891Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.5906189Z 2025-05-07T19:50:43.5906398Z HIPCC_FLAGS: 2025-05-07T19:50:43.5906596Z 2025-05-07T19:50:43.5906713Z 2025-05-07T19:50:43.5906918Z INCLUDE_DIRS: 2025-05-07T19:50:43.5907182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5907577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.5907882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.5908258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5908787Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.5909709Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.5910552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.5910982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.5911501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.5911986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.5912570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.5913148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.5913717Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.5914341Z 2025-05-07T19:50:43.5914628Z Selected Source Files: 2025-05-07T19:50:43.5915123Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:43.5915593Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:43.5916018Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:43.5916409Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:43.5916779Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:43.5917152Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:43.5917484Z 2025-05-07T19:50:43.5917697Z HIPified Source Files: 2025-05-07T19:50:43.5917859Z 2025-05-07T19:50:43.5917973Z 2025-05-07T19:50:43.5918180Z Library Dependencies: 2025-05-07T19:50:43.5918429Z torch 2025-05-07T19:50:43.5918622Z torch_library 2025-05-07T19:50:43.5919074Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.5919753Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.5920458Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.5921388Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.5922310Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.5922876Z fbgemm 2025-05-07T19:50:43.5923073Z fbgemm_gpu_config 2025-05-07T19:50:43.5923585Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.5923989Z 2025-05-07T19:50:43.5924203Z Output Library: 2025-05-07T19:50:43.5924433Z fbgemm_gpu_tbe_common 2025-05-07T19:50:43.5924679Z 2025-05-07T19:50:43.5924878Z Destination Directory: 2025-05-07T19:50:43.5925133Z fbgemm_gpu 2025-05-07T19:50:43.5925380Z ================================================================================ 2025-05-07T19:50:43.5925619Z 2025-05-07T19:50:43.5925624Z 2025-05-07T19:50:43.5925628Z 2025-05-07T19:50:43.5925742Z ================================================================================ 2025-05-07T19:50:43.5926153Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:50:43.5926507Z 2025-05-07T19:50:43.5926707Z CPU_SRCS: 2025-05-07T19:50:43.5926824Z 2025-05-07T19:50:43.5926902Z 2025-05-07T19:50:43.5927106Z GPU_SRCS: 2025-05-07T19:50:43.5927376Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:43.5927773Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:43.5928206Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:43.5928545Z 2025-05-07T19:50:43.5928758Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.5928901Z 2025-05-07T19:50:43.5928981Z 2025-05-07T19:50:43.5929188Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.5929326Z 2025-05-07T19:50:43.5929406Z 2025-05-07T19:50:43.5929605Z OTHER_SRCS: 2025-05-07T19:50:43.5929724Z 2025-05-07T19:50:43.5929820Z 2025-05-07T19:50:43.5930001Z CC_FLAGS: 2025-05-07T19:50:43.5930116Z 2025-05-07T19:50:43.5930213Z 2025-05-07T19:50:43.5930398Z NVCC_FLAGS: 2025-05-07T19:50:43.5930638Z --expt-relaxed-constexpr 2025-05-07T19:50:43.5930915Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.5931214Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.5931510Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.5931780Z 2025-05-07T19:50:43.5931971Z HIPCC_FLAGS: 2025-05-07T19:50:43.5932118Z 2025-05-07T19:50:43.5932201Z 2025-05-07T19:50:43.5932391Z INCLUDE_DIRS: 2025-05-07T19:50:43.5932642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5932972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.5933251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.5933582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5934070Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.5935075Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.5935668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.5936063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.5936480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.5936914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.5937408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.5937830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.5938365Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.5938829Z 2025-05-07T19:50:43.5939030Z Selected Source Files: 2025-05-07T19:50:43.5939307Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:43.5939999Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:43.5940433Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:43.5940863Z 2025-05-07T19:50:43.5941077Z HIPified Source Files: 2025-05-07T19:50:43.5941235Z 2025-05-07T19:50:43.5941317Z 2025-05-07T19:50:43.5941533Z Library Dependencies: 2025-05-07T19:50:43.5941965Z torch 2025-05-07T19:50:43.5942178Z torch_library 2025-05-07T19:50:43.5942616Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.5943314Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.5944115Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.5944911Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.5945725Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.5946356Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.5946900Z 2025-05-07T19:50:43.5947095Z Output Library: 2025-05-07T19:50:43.5947371Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:43.5947713Z 2025-05-07T19:50:43.5947949Z Destination Directory: 2025-05-07T19:50:43.5948208Z fbgemm_gpu 2025-05-07T19:50:43.5948512Z ================================================================================ 2025-05-07T19:50:43.5948747Z 2025-05-07T19:50:43.5948751Z 2025-05-07T19:50:43.5948773Z 2025-05-07T19:50:43.5948894Z ================================================================================ 2025-05-07T19:50:43.5949380Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:50:43.5949790Z 2025-05-07T19:50:43.5950043Z CPU_SRCS: 2025-05-07T19:50:43.5950325Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.5950710Z 2025-05-07T19:50:43.5950926Z GPU_SRCS: 2025-05-07T19:50:43.5951191Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:43.5951779Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:43.5952165Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:43.5952898Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:43.5953401Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:43.5953849Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:43.5954302Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:43.5954738Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:43.5955120Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:43.5955597Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:43.5956028Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:43.5956584Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:43.5957014Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:43.5957533Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:43.5958028Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:43.5958465Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:43.5958978Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:43.5959407Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:43.5959881Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:43.5960315Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:43.5960759Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:43.5961263Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.5961708Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:43.5962214Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:43.5962610Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:43.5963058Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:43.5963512Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:43.5963972Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:43.5964395Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.5964840Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:43.5965563Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:43.5965982Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.5966442Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:43.5966855Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:43.5967367Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.5967820Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:43.5968280Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:43.5968828Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:43.5969251Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:43.5969741Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:43.5970185Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:43.5970685Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.5971142Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:43.5971613Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:43.5972103Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.5972557Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:43.5973070Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:43.5973782Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:43.5974281Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:43.5974817Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:43.5975323Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.5975704Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.5976048Z 2025-05-07T19:50:43.5976256Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.5976427Z 2025-05-07T19:50:43.5976510Z 2025-05-07T19:50:43.5976735Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.5976884Z 2025-05-07T19:50:43.5976968Z 2025-05-07T19:50:43.5977185Z OTHER_SRCS: 2025-05-07T19:50:43.5977313Z 2025-05-07T19:50:43.5977397Z 2025-05-07T19:50:43.5977612Z CC_FLAGS: 2025-05-07T19:50:43.5977737Z 2025-05-07T19:50:43.5977822Z 2025-05-07T19:50:43.5978045Z NVCC_FLAGS: 2025-05-07T19:50:43.5978275Z --expt-relaxed-constexpr 2025-05-07T19:50:43.5978597Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.5978891Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.5979228Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.5979606Z 2025-05-07T19:50:43.5979813Z HIPCC_FLAGS: 2025-05-07T19:50:43.5979952Z 2025-05-07T19:50:43.5980067Z 2025-05-07T19:50:43.5980275Z INCLUDE_DIRS: 2025-05-07T19:50:43.5980635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5980960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.5981284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.5981603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.5982137Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.5982965Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.5983626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.5984073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.5984511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.5985031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.5985565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.5986076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.5986685Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.5987203Z 2025-05-07T19:50:43.5987449Z Selected Source Files: 2025-05-07T19:50:43.5987757Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.5988351Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:43.5988788Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:43.5989258Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:43.5989696Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:43.5990216Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:43.5990742Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:43.5991239Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:43.5991718Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:43.5992235Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:43.5992794Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:43.5993226Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.5993707Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.5994095Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:43.5994580Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:43.5994997Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:43.5995447Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:43.5995956Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:43.5996401Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:43.5996930Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:43.5997312Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:43.5997776Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:43.5998178Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:43.5998697Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:43.5999192Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:43.5999612Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:43.6000045Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:43.6000451Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:43.6000902Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.6001330Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:43.6001761Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:43.6002204Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:43.6002664Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:43.6003243Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:43.6003651Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.6004078Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:43.6004482Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:43.6004913Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.6005303Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:43.6005741Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.6006189Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:43.6006589Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:43.6007081Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:43.6007545Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:43.6008075Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:43.6008566Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.6009018Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:43.6009556Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:43.6010007Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:43.6010510Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:43.6010973Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:43.6011615Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:43.6012134Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:43.6012529Z 2025-05-07T19:50:43.6012738Z HIPified Source Files: 2025-05-07T19:50:43.6012922Z 2025-05-07T19:50:43.6013008Z 2025-05-07T19:50:43.6013297Z Library Dependencies: 2025-05-07T19:50:43.6013539Z torch 2025-05-07T19:50:43.6013765Z torch_library 2025-05-07T19:50:43.6014204Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.6014897Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.6015586Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.6016392Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.6017146Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.6017623Z fbgemm_gpu_tbe_common 2025-05-07T19:50:43.6018010Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.6018408Z 2025-05-07T19:50:43.6018627Z Output Library: 2025-05-07T19:50:43.6018870Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:50:43.6019156Z 2025-05-07T19:50:43.6019364Z Destination Directory: 2025-05-07T19:50:43.6019707Z fbgemm_gpu 2025-05-07T19:50:43.6020126Z ================================================================================ 2025-05-07T19:50:43.6020391Z 2025-05-07T19:50:43.6020395Z 2025-05-07T19:50:43.6020401Z 2025-05-07T19:50:43.6020517Z ================================================================================ 2025-05-07T19:50:43.6020992Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:50:43.6021397Z 2025-05-07T19:50:43.6021626Z CPU_SRCS: 2025-05-07T19:50:43.6021884Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:43.6022479Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:43.6022866Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:43.6023239Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:43.6023624Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:43.6023977Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:43.6024412Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:43.6024912Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:43.6025352Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:43.6025843Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:43.6026315Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:43.6026807Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:43.6027435Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:43.6028120Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:43.6028739Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:43.6029343Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:43.6029909Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:43.6030373Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6030924Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6031489Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6031947Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6032435Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6032903Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6033406Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6034103Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6034733Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6035238Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6035777Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6036265Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6036944Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6037577Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6038229Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6038808Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6039251Z 2025-05-07T19:50:43.6039477Z GPU_SRCS: 2025-05-07T19:50:43.6039813Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6040331Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6040801Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6041270Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6041915Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6042440Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6042954Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6043597Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6044155Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6044731Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6045298Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6045801Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6046426Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6047122Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6047789Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6048433Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6048965Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6049377Z 2025-05-07T19:50:43.6049581Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.6049750Z 2025-05-07T19:50:43.6049833Z 2025-05-07T19:50:43.6050034Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.6050202Z 2025-05-07T19:50:43.6050288Z 2025-05-07T19:50:43.6050505Z OTHER_SRCS: 2025-05-07T19:50:43.6050626Z 2025-05-07T19:50:43.6050711Z 2025-05-07T19:50:43.6050931Z CC_FLAGS: 2025-05-07T19:50:43.6051050Z 2025-05-07T19:50:43.6051136Z 2025-05-07T19:50:43.6051410Z NVCC_FLAGS: 2025-05-07T19:50:43.6051649Z --expt-relaxed-constexpr 2025-05-07T19:50:43.6051955Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.6052312Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.6052635Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.6052966Z 2025-05-07T19:50:43.6053231Z HIPCC_FLAGS: 2025-05-07T19:50:43.6053359Z 2025-05-07T19:50:43.6053466Z 2025-05-07T19:50:43.6053665Z INCLUDE_DIRS: 2025-05-07T19:50:43.6054000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6054326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.6054702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.6055018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6055586Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.6056461Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.6057095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.6057625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.6058032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.6058511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.6059003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.6059633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.6060375Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.6060910Z 2025-05-07T19:50:43.6061153Z Selected Source Files: 2025-05-07T19:50:43.6061445Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:43.6061856Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:43.6062239Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:43.6062608Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:43.6062951Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:43.6063336Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:43.6063739Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:43.6064207Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:43.6064629Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:43.6065053Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:43.6065532Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:43.6065961Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:43.6066500Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:43.6067091Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:43.6067703Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:43.6068230Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:43.6068704Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:43.6069350Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6096453Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6096989Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6097409Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6097886Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6098315Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6098827Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6099366Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6100181Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6100800Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6101351Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6101903Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6102518Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6103228Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6103907Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6104554Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:43.6105105Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6105594Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6106093Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6106517Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6106970Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6107411Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6108095Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6108679Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6109170Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6109772Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6110320Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6110871Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6111487Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6112305Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6112969Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6113542Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6114080Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:43.6114448Z 2025-05-07T19:50:43.6114682Z HIPified Source Files: 2025-05-07T19:50:43.6114835Z 2025-05-07T19:50:43.6114919Z 2025-05-07T19:50:43.6115145Z Library Dependencies: 2025-05-07T19:50:43.6115405Z torch 2025-05-07T19:50:43.6115607Z torch_library 2025-05-07T19:50:43.6116061Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.6116704Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.6117392Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.6118142Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.6118864Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.6119336Z fbgemm 2025-05-07T19:50:43.6119545Z fbgemm_gpu_config 2025-05-07T19:50:43.6119802Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:43.6120018Z fbgemm_gpu_tbe_common 2025-05-07T19:50:43.6120271Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:43.6120518Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:43.6120926Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.6121313Z 2025-05-07T19:50:43.6121540Z Output Library: 2025-05-07T19:50:43.6121779Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:50:43.6122434Z 2025-05-07T19:50:43.6122653Z Destination Directory: 2025-05-07T19:50:43.6122933Z fbgemm_gpu 2025-05-07T19:50:43.6123204Z ================================================================================ 2025-05-07T19:50:43.6123448Z 2025-05-07T19:50:43.6123453Z 2025-05-07T19:50:43.6123456Z 2025-05-07T19:50:43.6123575Z ================================================================================ 2025-05-07T19:50:43.6124028Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:50:43.6124427Z 2025-05-07T19:50:43.6124618Z CPU_SRCS: 2025-05-07T19:50:43.6124948Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:43.6125399Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:43.6125772Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:43.6126147Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:43.6126534Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:43.6126868Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:43.6127222Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:43.6127565Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:43.6127978Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:43.6128417Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:43.6128814Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:43.6129245Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:43.6129847Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:43.6130276Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:43.6130777Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:43.6131369Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:43.6132017Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:43.6132526Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:43.6132964Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:43.6133338Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:43.6133730Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:43.6134022Z 2025-05-07T19:50:43.6134227Z GPU_SRCS: 2025-05-07T19:50:43.6134488Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:43.6135010Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:43.6135436Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:43.6135858Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:43.6136288Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:43.6136730Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:43.6137215Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:43.6137681Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6138189Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6138711Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6139211Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:43.6139764Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6140459Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6140961Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:43.6141399Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6141876Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6142343Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6142863Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6143402Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6143880Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:43.6144340Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6144821Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6145304Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:43.6145799Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6146341Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6146889Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6147439Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6148046Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6148579Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:43.6149114Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6149650Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6150131Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:43.6150551Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6150971Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6151428Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6151967Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6152567Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6152974Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:43.6153377Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6153843Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6154244Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:43.6154638Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6155033Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6155462Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6155892Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6156360Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6156776Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:43.6157192Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6157645Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6158037Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:43.6158443Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6158845Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6159258Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6159690Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6160164Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6160605Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:43.6160995Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6161426Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6161836Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:43.6162263Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6162692Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6163142Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6163602Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6164103Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6164568Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:43.6164983Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6165454Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6165912Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:43.6166432Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6166957Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6167496Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6168224Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6168796Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6169348Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:43.6169852Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6170392Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6170923Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:43.6171412Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6171944Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6172530Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6173086Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6173644Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6174269Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:43.6174782Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6175312Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6175773Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:43.6176156Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6176570Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6176974Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6177424Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6177891Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6178567Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:43.6178970Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6179385Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6180139Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:43.6180801Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6181434Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6182071Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6182717Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6183400Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6184032Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:43.6184649Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6185279Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6185753Z 2025-05-07T19:50:43.6185959Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.6186106Z 2025-05-07T19:50:43.6186210Z 2025-05-07T19:50:43.6186405Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.6186772Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:43.6187265Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:43.6187754Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:43.6188124Z 2025-05-07T19:50:43.6188330Z OTHER_SRCS: 2025-05-07T19:50:43.6188450Z 2025-05-07T19:50:43.6188557Z 2025-05-07T19:50:43.6188734Z CC_FLAGS: 2025-05-07T19:50:43.6188853Z 2025-05-07T19:50:43.6188957Z 2025-05-07T19:50:43.6189136Z NVCC_FLAGS: 2025-05-07T19:50:43.6189378Z --expt-relaxed-constexpr 2025-05-07T19:50:43.6189646Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.6189948Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.6190234Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.6190502Z 2025-05-07T19:50:43.6190687Z HIPCC_FLAGS: 2025-05-07T19:50:43.6190828Z 2025-05-07T19:50:43.6190905Z 2025-05-07T19:50:43.6191107Z INCLUDE_DIRS: 2025-05-07T19:50:43.6191332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6191655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.6191935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.6192246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6192735Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.6193540Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.6194274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.6194688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.6195135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.6195592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.6196178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.6196628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.6196954Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.6197034Z 2025-05-07T19:50:43.6197128Z Selected Source Files: 2025-05-07T19:50:43.6197338Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:43.6197471Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:43.6197599Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:43.6197746Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:43.6197883Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:43.6198000Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:43.6198112Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:43.6198259Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:43.6198537Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:43.6198685Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:43.6198794Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:43.6198996Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:43.6199117Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:43.6199282Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:43.6199504Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:43.6199727Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:43.6199929Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:43.6200115Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:43.6200231Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:43.6200365Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:43.6200480Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:43.6200627Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:43.6200785Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:43.6200944Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:43.6201108Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:43.6201267Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:43.6201447Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:43.6201652Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:43.6201843Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6202052Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6202263Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6202450Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:43.6202636Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6202829Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6202987Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:43.6203146Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6203312Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6203499Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6203691Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6203945Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6204092Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:43.6204270Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6204495Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6204657Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:43.6204862Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6205049Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6205240Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6205473Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6205695Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6205875Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:43.6206097Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6206295Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6206422Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:43.6206570Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6206733Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6206883Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6207056Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6207249Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6207383Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:43.6207537Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6207706Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6207840Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:43.6207986Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6208131Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6208297Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6208477Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6208661Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6208819Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:43.6208975Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6209131Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6209269Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:43.6209413Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6209557Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6209716Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6209903Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6210088Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6210227Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:43.6210403Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6210562Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6210696Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:43.6210877Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6211045Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6211214Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6211405Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6211658Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6211802Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:43.6211969Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6212163Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6212402Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:43.6212616Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6212847Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6213057Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6213289Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6213523Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6213728Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:43.6213936Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6214147Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6214341Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:43.6214549Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6214758Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6214987Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6215223Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6215457Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6215666Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:43.6215880Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6216095Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6216226Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:43.6216396Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6216553Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6216704Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6216898Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6217080Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6217219Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:43.6217388Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6217545Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6217760Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:43.6217994Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6218252Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6218497Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6218755Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6219036Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6219251Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:43.6219570Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6219996Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6220080Z 2025-05-07T19:50:43.6220271Z HIPified Source Files: 2025-05-07T19:50:43.6220276Z 2025-05-07T19:50:43.6220367Z 2025-05-07T19:50:43.6220465Z Library Dependencies: 2025-05-07T19:50:43.6220581Z torch 2025-05-07T19:50:43.6220666Z torch_library 2025-05-07T19:50:43.6220996Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.6221308Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.6221645Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.6222192Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.6222466Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.6222543Z fbgemm 2025-05-07T19:50:43.6222645Z fbgemm_gpu_config 2025-05-07T19:50:43.6222737Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:43.6222827Z fbgemm_gpu_tbe_common 2025-05-07T19:50:43.6222926Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:43.6223055Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:43.6223269Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.6223350Z 2025-05-07T19:50:43.6223456Z Output Library: 2025-05-07T19:50:43.6223558Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:43.6223633Z 2025-05-07T19:50:43.6223729Z Destination Directory: 2025-05-07T19:50:43.6223839Z fbgemm_gpu 2025-05-07T19:50:43.6223956Z ================================================================================ 2025-05-07T19:50:43.6223961Z 2025-05-07T19:50:43.6223965Z 2025-05-07T19:50:43.6223969Z 2025-05-07T19:50:43.6224079Z ================================================================================ 2025-05-07T19:50:43.6224306Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:50:43.6224389Z 2025-05-07T19:50:43.6224472Z CPU_SRCS: 2025-05-07T19:50:43.6224476Z 2025-05-07T19:50:43.6224579Z 2025-05-07T19:50:43.6224667Z GPU_SRCS: 2025-05-07T19:50:43.6224873Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:43.6225093Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:43.6225328Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:43.6225536Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:43.6225762Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:43.6226003Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:43.6226208Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:43.6226438Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:43.6226682Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:43.6226900Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:43.6227145Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:43.6227398Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:43.6227471Z 2025-05-07T19:50:43.6227564Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.6227569Z 2025-05-07T19:50:43.6227653Z 2025-05-07T19:50:43.6227760Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.6227768Z 2025-05-07T19:50:43.6227838Z 2025-05-07T19:50:43.6227920Z OTHER_SRCS: 2025-05-07T19:50:43.6227924Z 2025-05-07T19:50:43.6228005Z 2025-05-07T19:50:43.6228082Z CC_FLAGS: 2025-05-07T19:50:43.6228086Z 2025-05-07T19:50:43.6228161Z 2025-05-07T19:50:43.6228245Z NVCC_FLAGS: 2025-05-07T19:50:43.6228359Z --expt-relaxed-constexpr 2025-05-07T19:50:43.6228458Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.6228563Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.6228679Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.6228753Z 2025-05-07T19:50:43.6228832Z HIPCC_FLAGS: 2025-05-07T19:50:43.6228950Z 2025-05-07T19:50:43.6229028Z 2025-05-07T19:50:43.6229138Z INCLUDE_DIRS: 2025-05-07T19:50:43.6229246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6229348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.6229479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.6229585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6229943Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.6230360Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.6230506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.6230666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.6230825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.6231047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.6231251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.6231401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.6231723Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.6231801Z 2025-05-07T19:50:43.6231897Z Selected Source Files: 2025-05-07T19:50:43.6232114Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:43.6232332Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:43.6232553Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:43.6232754Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:43.6232990Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:43.6233213Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:43.6233422Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:43.6233666Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:43.6234009Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:43.6234207Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:43.6234452Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:43.6234678Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:43.6234750Z 2025-05-07T19:50:43.6234840Z HIPified Source Files: 2025-05-07T19:50:43.6234857Z 2025-05-07T19:50:43.6234923Z 2025-05-07T19:50:43.6235013Z Library Dependencies: 2025-05-07T19:50:43.6235086Z torch 2025-05-07T19:50:43.6235178Z torch_library 2025-05-07T19:50:43.6235463Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.6235700Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.6236020Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.6236345Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.6236589Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.6236683Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:43.6236887Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.6236961Z 2025-05-07T19:50:43.6237045Z Output Library: 2025-05-07T19:50:43.6237158Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:50:43.6237230Z 2025-05-07T19:50:43.6237315Z Destination Directory: 2025-05-07T19:50:43.6237391Z fbgemm_gpu 2025-05-07T19:50:43.6237507Z ================================================================================ 2025-05-07T19:50:43.6237511Z 2025-05-07T19:50:43.6237514Z 2025-05-07T19:50:43.6237518Z 2025-05-07T19:50:43.6237622Z ================================================================================ 2025-05-07T19:50:43.6237948Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:50:43.6238031Z 2025-05-07T19:50:43.6238108Z CPU_SRCS: 2025-05-07T19:50:43.6238112Z 2025-05-07T19:50:43.6238184Z 2025-05-07T19:50:43.6238286Z GPU_SRCS: 2025-05-07T19:50:43.6238534Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6238715Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6238937Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6239116Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6239346Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6239583Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6239743Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6239898Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6240049Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6240225Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6240370Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6240532Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6240734Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6240945Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6241154Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6241325Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6241539Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6241738Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6241928Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6242147Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6242355Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6242529Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6242740Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6242944Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6243173Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6243432Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6243676Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6243910Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6244161Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6244426Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6244566Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6244729Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6244906Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6245043Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6245213Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6245398Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6245539Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6245702Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6245891Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6246091Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6246265Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6246449Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6246614Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6246820Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6246983Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6247155Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6247331Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6247506Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6247599Z 2025-05-07T19:50:43.6247690Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.6247694Z 2025-05-07T19:50:43.6247766Z 2025-05-07T19:50:43.6247853Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.6247860Z 2025-05-07T19:50:43.6247953Z 2025-05-07T19:50:43.6248032Z OTHER_SRCS: 2025-05-07T19:50:43.6248036Z 2025-05-07T19:50:43.6248103Z 2025-05-07T19:50:43.6248188Z CC_FLAGS: 2025-05-07T19:50:43.6248192Z 2025-05-07T19:50:43.6248266Z 2025-05-07T19:50:43.6248339Z NVCC_FLAGS: 2025-05-07T19:50:43.6248432Z --expt-relaxed-constexpr 2025-05-07T19:50:43.6248552Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.6248655Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.6248758Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.6248832Z 2025-05-07T19:50:43.6248918Z HIPCC_FLAGS: 2025-05-07T19:50:43.6248923Z 2025-05-07T19:50:43.6249011Z 2025-05-07T19:50:43.6249088Z INCLUDE_DIRS: 2025-05-07T19:50:43.6249189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6249294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.6249392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.6249493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6249754Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.6250135Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.6250263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.6250426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.6250569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.6250761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.6250966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.6251097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.6251378Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.6251464Z 2025-05-07T19:50:43.6251549Z Selected Source Files: 2025-05-07T19:50:43.6251733Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6251917Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6252134Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6252319Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6252548Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6252790Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6252934Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6253076Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6253230Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6253388Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6253532Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:43.6253696Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:43.6253944Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6254153Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6254362Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6254595Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6254793Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6254993Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6255197Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6255406Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6255615Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6255803Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6256003Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6256204Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6256424Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6256683Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6256925Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6257153Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6257411Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6257661Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6257804Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6257987Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6258153Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6258291Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6258461Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6258623Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6258763Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6258926Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6259106Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6259249Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6259505Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6259712Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6260021Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:43.6260197Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6260396Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6260590Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:43.6260763Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:43.6260952Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:43.6261046Z 2025-05-07T19:50:43.6261133Z HIPified Source Files: 2025-05-07T19:50:43.6261137Z 2025-05-07T19:50:43.6261211Z 2025-05-07T19:50:43.6261309Z Library Dependencies: 2025-05-07T19:50:43.6261390Z torch 2025-05-07T19:50:43.6261475Z torch_library 2025-05-07T19:50:43.6261779Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.6262049Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.6262381Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.6262786Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.6263065Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.6263173Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:43.6263429Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.6263520Z 2025-05-07T19:50:43.6263609Z Output Library: 2025-05-07T19:50:43.6263719Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:50:43.6263794Z 2025-05-07T19:50:43.6263901Z Destination Directory: 2025-05-07T19:50:43.6263987Z fbgemm_gpu 2025-05-07T19:50:43.6264102Z ================================================================================ 2025-05-07T19:50:43.6264107Z 2025-05-07T19:50:43.6264111Z 2025-05-07T19:50:43.6264114Z 2025-05-07T19:50:43.6264226Z ================================================================================ 2025-05-07T19:50:43.6264445Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:50:43.6264524Z 2025-05-07T19:50:43.6264619Z CPU_SRCS: 2025-05-07T19:50:43.6264623Z 2025-05-07T19:50:43.6264699Z 2025-05-07T19:50:43.6264779Z GPU_SRCS: 2025-05-07T19:50:43.6264932Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:43.6265080Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:43.6265245Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6265412Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6265599Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6265770Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:43.6265971Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6266181Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6266326Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:43.6266487Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:43.6266678Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6266848Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6266957Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:43.6267033Z 2025-05-07T19:50:43.6267142Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.6267147Z 2025-05-07T19:50:43.6267222Z 2025-05-07T19:50:43.6267306Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.6267311Z 2025-05-07T19:50:43.6267404Z 2025-05-07T19:50:43.6267479Z OTHER_SRCS: 2025-05-07T19:50:43.6267484Z 2025-05-07T19:50:43.6267553Z 2025-05-07T19:50:43.6267630Z CC_FLAGS: 2025-05-07T19:50:43.6267634Z 2025-05-07T19:50:43.6267721Z 2025-05-07T19:50:43.6267807Z NVCC_FLAGS: 2025-05-07T19:50:43.6267910Z --expt-relaxed-constexpr 2025-05-07T19:50:43.6268024Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.6268132Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.6268235Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.6268307Z 2025-05-07T19:50:43.6268403Z HIPCC_FLAGS: 2025-05-07T19:50:43.6268407Z 2025-05-07T19:50:43.6268484Z 2025-05-07T19:50:43.6268562Z INCLUDE_DIRS: 2025-05-07T19:50:43.6268690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6268791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.6268902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.6269194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6269490Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.6269876Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.6270025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.6270197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.6270352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.6270608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.6270819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.6270961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.6271273Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.6271371Z 2025-05-07T19:50:43.6271504Z Selected Source Files: 2025-05-07T19:50:43.6271653Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:43.6271830Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:43.6271995Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:43.6272107Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:43.6272354Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:43.6272511Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:43.6272665Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:43.6272820Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:43.6272997Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:43.6273192Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:43.6273323Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:43.6273480Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:43.6273651Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:43.6273723Z 2025-05-07T19:50:43.6273802Z HIPified Source Files: 2025-05-07T19:50:43.6273806Z 2025-05-07T19:50:43.6273887Z 2025-05-07T19:50:43.6273974Z Library Dependencies: 2025-05-07T19:50:43.6274043Z torch 2025-05-07T19:50:43.6274113Z torch_library 2025-05-07T19:50:43.6274407Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.6274637Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.6274939Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.6275274Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.6275525Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.6275623Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:43.6275841Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.6275913Z 2025-05-07T19:50:43.6275993Z Output Library: 2025-05-07T19:50:43.6276096Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:50:43.6276190Z 2025-05-07T19:50:43.6276281Z Destination Directory: 2025-05-07T19:50:43.6276353Z fbgemm_gpu 2025-05-07T19:50:43.6276471Z ================================================================================ 2025-05-07T19:50:43.6276475Z 2025-05-07T19:50:43.6276479Z 2025-05-07T19:50:43.6276482Z 2025-05-07T19:50:43.6276588Z ================================================================================ 2025-05-07T19:50:43.6276797Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:50:43.6276884Z 2025-05-07T19:50:43.6276963Z CPU_SRCS: 2025-05-07T19:50:43.6276966Z 2025-05-07T19:50:43.6277038Z 2025-05-07T19:50:43.6277111Z GPU_SRCS: 2025-05-07T19:50:43.6277242Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:43.6277373Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:43.6277478Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:43.6277590Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:43.6277696Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:43.6277804Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:43.6277946Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:43.6278098Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:43.6278196Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:43.6278418Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:43.6278541Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:43.6278684Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:43.6278874Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:43.6279143Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:43.6279329Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:43.6279488Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:43.6279606Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:43.6279769Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:43.6279928Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:43.6280105Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:43.6280314Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:43.6280457Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:43.6280607Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:43.6280769Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:43.6280915Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:43.6281056Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:43.6281205Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:43.6281389Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:43.6281553Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:43.6281748Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:43.6281973Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:43.6282170Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:43.6282375Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:43.6282546Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:43.6282697Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:43.6282921Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:43.6283158Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:43.6283267Z 2025-05-07T19:50:43.6283362Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.6283366Z 2025-05-07T19:50:43.6283448Z 2025-05-07T19:50:43.6283565Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.6283569Z 2025-05-07T19:50:43.6283652Z 2025-05-07T19:50:43.6283740Z OTHER_SRCS: 2025-05-07T19:50:43.6283744Z 2025-05-07T19:50:43.6283856Z 2025-05-07T19:50:43.6283944Z CC_FLAGS: 2025-05-07T19:50:43.6283948Z 2025-05-07T19:50:43.6284031Z 2025-05-07T19:50:43.6284120Z NVCC_FLAGS: 2025-05-07T19:50:43.6284265Z --expt-relaxed-constexpr 2025-05-07T19:50:43.6284378Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.6284489Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.6284626Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.6284712Z 2025-05-07T19:50:43.6284803Z HIPCC_FLAGS: 2025-05-07T19:50:43.6284807Z 2025-05-07T19:50:43.6284890Z 2025-05-07T19:50:43.6285007Z INCLUDE_DIRS: 2025-05-07T19:50:43.6285126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6285236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.6285371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.6285485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6285761Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.6286135Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.6286306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.6286655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.6287596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.6287836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.6288048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.6288204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.6288599Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.6288691Z 2025-05-07T19:50:43.6288801Z Selected Source Files: 2025-05-07T19:50:43.6288933Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:43.6289107Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:43.6289227Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:43.6289350Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:43.6289499Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:43.6289627Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:43.6289791Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:43.6289982Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:43.6290105Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:43.6290293Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:43.6290426Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:43.6290614Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:43.6290832Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:43.6291062Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:43.6291290Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:43.6291462Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:43.6291602Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:43.6291782Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:43.6291949Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:43.6292147Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:43.6292345Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:43.6292513Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:43.6292665Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:43.6292815Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:43.6292997Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:43.6293144Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:43.6293301Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:43.6293481Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:43.6293650Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:43.6293864Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:43.6294082Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:43.6294315Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:43.6294530Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:43.6294683Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:43.6294867Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:43.6295108Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:43.6295357Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:43.6295467Z 2025-05-07T19:50:43.6295571Z HIPified Source Files: 2025-05-07T19:50:43.6295576Z 2025-05-07T19:50:43.6295669Z 2025-05-07T19:50:43.6295772Z Library Dependencies: 2025-05-07T19:50:43.6295884Z torch 2025-05-07T19:50:43.6295978Z torch_library 2025-05-07T19:50:43.6296300Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.6296632Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.6296973Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.6297335Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.6297687Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.6297785Z fbgemm_gpu_config 2025-05-07T19:50:43.6297882Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:43.6298107Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.6298224Z 2025-05-07T19:50:43.6298320Z Output Library: 2025-05-07T19:50:43.6298451Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:50:43.6298559Z 2025-05-07T19:50:43.6298660Z Destination Directory: 2025-05-07T19:50:43.6298752Z fbgemm_gpu 2025-05-07T19:50:43.6298872Z ================================================================================ 2025-05-07T19:50:43.6298880Z 2025-05-07T19:50:43.6298884Z 2025-05-07T19:50:43.6298912Z 2025-05-07T19:50:43.6299030Z ================================================================================ 2025-05-07T19:50:43.6299211Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:50:43.6299295Z 2025-05-07T19:50:43.6299466Z CPU_SRCS: 2025-05-07T19:50:43.6299709Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:43.6299912Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:43.6300028Z 2025-05-07T19:50:43.6300119Z GPU_SRCS: 2025-05-07T19:50:43.6300321Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:43.6300472Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:43.6300634Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:43.6300783Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:43.6300936Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:43.6301115Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:43.6301261Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:43.6301404Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:43.6301486Z 2025-05-07T19:50:43.6301605Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.6301609Z 2025-05-07T19:50:43.6301696Z 2025-05-07T19:50:43.6301791Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.6301795Z 2025-05-07T19:50:43.6301900Z 2025-05-07T19:50:43.6301989Z OTHER_SRCS: 2025-05-07T19:50:43.6301994Z 2025-05-07T19:50:43.6302077Z 2025-05-07T19:50:43.6302189Z CC_FLAGS: 2025-05-07T19:50:43.6302193Z 2025-05-07T19:50:43.6302276Z 2025-05-07T19:50:43.6302367Z NVCC_FLAGS: 2025-05-07T19:50:43.6302479Z --expt-relaxed-constexpr 2025-05-07T19:50:43.6302608Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.6302721Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.6302833Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.6302945Z 2025-05-07T19:50:43.6303039Z HIPCC_FLAGS: 2025-05-07T19:50:43.6303044Z 2025-05-07T19:50:43.6303131Z 2025-05-07T19:50:43.6303220Z INCLUDE_DIRS: 2025-05-07T19:50:43.6303359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6303467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.6303579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.6303719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6304012Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.6304419Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.6304592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.6304761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.6304923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.6305136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.6305425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.6305577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.6305890Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.6306005Z 2025-05-07T19:50:43.6306104Z Selected Source Files: 2025-05-07T19:50:43.6306365Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:43.6306563Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:43.6306788Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:43.6306931Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:43.6307065Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:43.6307228Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:43.6307378Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:43.6307520Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:43.6307685Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:43.6307825Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:43.6307910Z 2025-05-07T19:50:43.6308011Z HIPified Source Files: 2025-05-07T19:50:43.6308015Z 2025-05-07T19:50:43.6308124Z 2025-05-07T19:50:43.6308229Z Library Dependencies: 2025-05-07T19:50:43.6308317Z torch 2025-05-07T19:50:43.6308433Z torch_library 2025-05-07T19:50:43.6308749Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.6309010Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.6309376Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.6309733Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.6310012Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.6310129Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:43.6310250Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:43.6310469Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.6310556Z 2025-05-07T19:50:43.6310679Z Output Library: 2025-05-07T19:50:43.6310789Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:43.6310877Z 2025-05-07T19:50:43.6310988Z Destination Directory: 2025-05-07T19:50:43.6311110Z fbgemm_gpu 2025-05-07T19:50:43.6311231Z ================================================================================ 2025-05-07T19:50:43.6311236Z 2025-05-07T19:50:43.6311240Z 2025-05-07T19:50:43.6311244Z 2025-05-07T19:50:43.6311367Z ================================================================================ 2025-05-07T19:50:43.6311600Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:50:43.6311686Z 2025-05-07T19:50:43.6311776Z CPU_SRCS: 2025-05-07T19:50:43.6312090Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:43.6312177Z 2025-05-07T19:50:43.6312262Z GPU_SRCS: 2025-05-07T19:50:43.6312432Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:43.6312604Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:43.6312685Z 2025-05-07T19:50:43.6312777Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.6312781Z 2025-05-07T19:50:43.6312887Z 2025-05-07T19:50:43.6312979Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.6312982Z 2025-05-07T19:50:43.6313063Z 2025-05-07T19:50:43.6313172Z OTHER_SRCS: 2025-05-07T19:50:43.6313176Z 2025-05-07T19:50:43.6313255Z 2025-05-07T19:50:43.6313341Z CC_FLAGS: 2025-05-07T19:50:43.6313345Z 2025-05-07T19:50:43.6313423Z 2025-05-07T19:50:43.6313509Z NVCC_FLAGS: 2025-05-07T19:50:43.6313597Z --expt-relaxed-constexpr 2025-05-07T19:50:43.6313687Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.6313790Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.6313875Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.6313991Z 2025-05-07T19:50:43.6314070Z HIPCC_FLAGS: 2025-05-07T19:50:43.6314074Z 2025-05-07T19:50:43.6314159Z 2025-05-07T19:50:43.6314233Z INCLUDE_DIRS: 2025-05-07T19:50:43.6314329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6314432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.6314526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.6314678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6314940Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.6315307Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.6315438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.6315583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.6315732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.6315918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.6316103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.6316239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.6316521Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.6316590Z 2025-05-07T19:50:43.6316672Z Selected Source Files: 2025-05-07T19:50:43.6316841Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:43.6316999Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:43.6317140Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:43.6317215Z 2025-05-07T19:50:43.6317294Z HIPified Source Files: 2025-05-07T19:50:43.6317298Z 2025-05-07T19:50:43.6317362Z 2025-05-07T19:50:43.6317455Z Library Dependencies: 2025-05-07T19:50:43.6317524Z torch 2025-05-07T19:50:43.6317598Z torch_library 2025-05-07T19:50:43.6317875Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.6318115Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.6318413Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.6318735Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.6318996Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.6319186Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.6319250Z 2025-05-07T19:50:43.6319328Z Output Library: 2025-05-07T19:50:43.6319431Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:43.6319498Z 2025-05-07T19:50:43.6319579Z Destination Directory: 2025-05-07T19:50:43.6319661Z fbgemm_gpu 2025-05-07T19:50:43.6319764Z ================================================================================ 2025-05-07T19:50:43.6319768Z 2025-05-07T19:50:43.6319771Z 2025-05-07T19:50:43.6319778Z 2025-05-07T19:50:43.6319874Z ================================================================================ 2025-05-07T19:50:43.6319996Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:50:43.6320064Z 2025-05-07T19:50:43.6320135Z CPU_SRCS: 2025-05-07T19:50:43.6320226Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:43.6320324Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:43.6320509Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:43.6320702Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:43.6320893Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:43.6321092Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:43.6321286Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:43.6321516Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:43.6321655Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:43.6321823Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:43.6322119Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:43.6322254Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:43.6322387Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:43.6322589Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:43.6322898Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:43.6323019Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:43.6323123Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:43.6323236Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:43.6323337Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:43.6323427Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:43.6323525Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:43.6323645Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:43.6323747Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:43.6323855Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:43.6324109Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:43.6324261Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:43.6324476Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:43.6324715Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:43.6324826Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:43.6324926Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:43.6325023Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:43.6325147Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:43.6325340Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:43.6325427Z src/topology_utils.cpp 2025-05-07T19:50:43.6325509Z 2025-05-07T19:50:43.6325584Z GPU_SRCS: 2025-05-07T19:50:43.6325692Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:43.6325797Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:43.6326031Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:43.6326130Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:43.6326234Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:43.6326439Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:43.6326628Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:43.6326758Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:43.6326894Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:43.6327175Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:43.6327354Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:43.6327531Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:43.6327698Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:43.6327846Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:43.6327984Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:43.6328130Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:43.6328261Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:43.6328377Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:43.6328538Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:43.6328695Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:43.6328823Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:43.6328973Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:43.6329124Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:43.6329221Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:43.6329439Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:43.6329639Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:43.6329902Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:43.6330004Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:43.6330115Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:43.6330267Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:43.6330400Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:43.6330553Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:43.6330676Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:43.6330806Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:43.6330908Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:43.6331031Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:43.6331182Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:43.6331299Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:43.6331431Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:43.6331586Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:43.6331731Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:43.6331842Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:43.6331955Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:43.6332059Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:43.6332176Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:43.6332310Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:43.6332455Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:43.6332562Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:43.6332668Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:43.6332788Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:43.6332907Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:43.6333009Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:43.6333127Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:43.6333247Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:43.6333346Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:43.6333422Z 2025-05-07T19:50:43.6333532Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:43.6333537Z 2025-05-07T19:50:43.6333610Z 2025-05-07T19:50:43.6333699Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:43.6333703Z 2025-05-07T19:50:43.6333780Z 2025-05-07T19:50:43.6333878Z OTHER_SRCS: 2025-05-07T19:50:43.6333883Z 2025-05-07T19:50:43.6333958Z 2025-05-07T19:50:43.6334040Z CC_FLAGS: 2025-05-07T19:50:43.6334047Z 2025-05-07T19:50:43.6334145Z 2025-05-07T19:50:43.6334222Z NVCC_FLAGS: 2025-05-07T19:50:43.6334322Z --expt-relaxed-constexpr 2025-05-07T19:50:43.6334423Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:43.6334550Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:43.6334640Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:43.6334714Z 2025-05-07T19:50:43.6334818Z HIPCC_FLAGS: 2025-05-07T19:50:43.6334822Z 2025-05-07T19:50:43.6334892Z 2025-05-07T19:50:43.6334974Z INCLUDE_DIRS: 2025-05-07T19:50:43.6335081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6335313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:43.6335411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:43.6335507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:43.6335794Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:43.6336161Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:43.6336299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:43.6336475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:43.6336624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:43.6336812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:43.6336995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:43.6337146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:43.6337433Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:43.6337553Z 2025-05-07T19:50:43.6337652Z Selected Source Files: 2025-05-07T19:50:43.6337750Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:43.6337847Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:43.6338042Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:43.6338276Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:43.6338470Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:43.6338675Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:43.6338882Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:43.6339100Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:43.6339243Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:43.6339376Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:43.6339580Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:43.6339696Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:43.6340030Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:43.6340136Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:43.6340244Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:43.6340379Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:43.6340498Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:43.6340598Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:43.6340698Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:43.6340899Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:43.6341004Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:43.6341110Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:43.6341223Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:43.6341348Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:43.6341593Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:43.6341756Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:43.6341992Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:43.6342230Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:43.6342339Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:43.6342468Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:43.6342569Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:43.6342690Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:43.6342895Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:43.6343008Z src/topology_utils.cpp 2025-05-07T19:50:43.6343122Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:43.6343236Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:43.6343477Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:43.6343576Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:43.6343683Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:43.6343878Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:43.6344085Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:43.6344216Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:43.6344346Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:43.6344629Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:43.6344810Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:43.6344985Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:43.6345146Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:43.6345299Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:43.6345436Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:43.6345564Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:43.6345797Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:43.6345912Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:43.6346067Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:43.6346235Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:43.6346371Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:43.6346563Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:43.6346721Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:43.6346819Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:43.6347047Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:43.6347238Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:43.6347443Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:43.6347554Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:43.6347659Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:43.6347810Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:43.6347940Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:43.6348049Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:43.6348157Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:43.6348283Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:43.6348393Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:43.6348519Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:43.6348672Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:43.6348787Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:43.6348924Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:43.6349084Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:43.6349222Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:43.6349331Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:43.6349431Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:43.6349561Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:43.6349664Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:43.6349794Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:43.6349942Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:43.6350039Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:43.6350144Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:43.6350244Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:43.6350388Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:43.6350480Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:43.6350600Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:43.6350721Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:43.6350824Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:43.6350901Z 2025-05-07T19:50:43.6350988Z HIPified Source Files: 2025-05-07T19:50:43.6351009Z 2025-05-07T19:50:43.6351092Z 2025-05-07T19:50:43.6351190Z Library Dependencies: 2025-05-07T19:50:43.6351270Z torch 2025-05-07T19:50:43.6351363Z torch_library 2025-05-07T19:50:43.6351681Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:43.6351940Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:43.6352395Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:43.6352723Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:43.6352975Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:43.6353046Z fbgemm 2025-05-07T19:50:43.6353157Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:43.6353256Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:43.6353353Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:43.6353439Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:43.6353532Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:43.6353622Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:43.6353870Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:43.6353954Z 2025-05-07T19:50:43.6354040Z Output Library: 2025-05-07T19:50:43.6354121Z fbgemm_gpu_py 2025-05-07T19:50:43.6354202Z 2025-05-07T19:50:43.6354292Z Destination Directory: 2025-05-07T19:50:43.6354376Z fbgemm_gpu 2025-05-07T19:50:43.6354524Z ================================================================================ 2025-05-07T19:50:43.6354541Z 2025-05-07T19:50:43.6354639Z -- Configuring done (9.1s) 2025-05-07T19:50:43.7600557Z -- Generating done (0.1s) 2025-05-07T19:50:43.7621158Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build 2025-05-07T19:50:43.7780281Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build' 2025-05-07T19:50:43.7780746Z 2025-05-07T19:50:43.7781221Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:50:43.9011556Z [1/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:43.9023472Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.9222255Z [2/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:43.9233629Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.9269505Z [3/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:43.9281375Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.9338269Z [4/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:43.9350155Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.9361354Z [5/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:43.9372784Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.9482022Z [6/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:43.9493765Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.9505208Z [7/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:43.9516411Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.9554324Z [8/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:43.9566617Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.9579158Z [9/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:43.9591357Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.9739100Z [10/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:43.9751057Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.9873600Z [11/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:43.9885377Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.0029898Z [12/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:44.0041616Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.0189311Z [13/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:44.0201030Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.0270483Z [14/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:44.0283064Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.0306955Z [15/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:44.0318813Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.0330387Z [16/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:44.0341662Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.0410140Z [17/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:44.0422259Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.0457846Z [18/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:44.0469512Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.0705519Z [19/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:44.0711802Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.0844068Z [20/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:44.0855790Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.0920900Z [21/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:44.0932977Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.1247897Z [22/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:44.1260030Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.1271339Z [23/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:44.1282678Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.1313110Z [24/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:44.1325444Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.1375812Z [25/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:44.1387096Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.1397995Z [26/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:44.1409365Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.1548088Z [27/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:44.1559587Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.1713935Z [28/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:44.1726463Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.1738453Z [29/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:44.1750243Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.1805317Z [30/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:44.1816441Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.1892380Z [31/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:44.1904518Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.1935101Z [32/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:44.1946401Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.2003826Z [33/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:44.2015492Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.2120936Z [34/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:44.2133376Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.2224718Z [35/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:44.2236637Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.2471532Z [36/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:44.2483099Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.2585335Z [37/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:44.2599838Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.2740075Z [38/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:44.2751572Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.2904384Z [39/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:44.2915720Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.3154078Z [40/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:44.3165760Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.3297474Z [41/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:44.3309612Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.3335553Z [42/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:44.3346509Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.3554081Z [43/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:44.3564610Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.4285648Z [44/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:44.4297300Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.4650125Z [45/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:44.4697928Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.4710680Z [46/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:44.4723563Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.4975120Z [47/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:44.4987438Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.5190419Z [48/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:44.5202461Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.5475737Z [49/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:44.5486510Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.5599668Z [50/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:44.5611711Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.5854662Z [51/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:44.5867622Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.6538218Z [52/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:44.6544644Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.6949843Z [53/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:44.6956098Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.8275536Z [54/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:44.8288449Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.9609634Z [55/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:44.9622545Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:45.0129307Z [56/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:45.0146492Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:45.0312559Z [57/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:45.0329458Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:45.0599600Z [58/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:45.0611274Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:45.1795002Z [59/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:45.1801370Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:45.2479398Z [60/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:45.2492004Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:45.3593069Z [61/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:50:45.3611931Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:45.7513955Z [62/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:45.7526935Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:46.3552660Z [63/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:46.4339250Z [64/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:50:46.4355165Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:46.8656521Z [65/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:50:46.8674908Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:50.3161832Z [66/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:50:50.3178698Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:50.4860781Z [67/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:50.4880031Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:52.3374895Z [68/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:52.3392234Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:52.3883587Z [69/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:52.3901850Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:52.4600728Z [70/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:52.4618575Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:52.4784731Z [71/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:52.4802062Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:52.6937807Z [72/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:52.6957101Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:53.2808096Z [73/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:53.2828698Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:53.9853530Z [74/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:53.9873631Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:54.9059623Z [75/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:54.9077469Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:55.6870383Z [76/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:50:55.6889242Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:55.9247400Z [77/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:50:56.2632999Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:56.2649870Z [78/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:56.9440404Z [79/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:56.9459284Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:59.3628705Z [80/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:59.3638674Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:59.6007600Z [81/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:59.6027428Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:01.4447843Z [82/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:51:01.4465706Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:03.4110722Z [83/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:51:03.4130501Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:03.9407528Z [84/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:51:03.9417243Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:04.2514050Z [85/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:51:04.2531883Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:06.6641352Z [86/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:51:06.6658768Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:08.3231725Z [87/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:51:08.3249480Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:10.1079405Z [88/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:51:10.1096341Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:13.5379803Z [89/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:51:13.5397544Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:14.0129820Z [90/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:51:14.0148428Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:16.1548561Z [91/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:51:16.1566516Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:17.7178761Z [92/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:18.9577831Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:18.9595882Z [93/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:18.9613943Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:19.1598996Z [94/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:51:19.1616890Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:22.6154186Z [95/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:22.6172803Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:23.0345774Z [96/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:23.0365893Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:25.4235178Z [97/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:25.4254854Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:27.0702555Z [98/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:27.0723068Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:28.3230530Z [99/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:28.3250695Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:28.4751575Z [100/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:28.4772229Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:31.8277876Z [101/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:31.8298246Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:40.4339029Z [102/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:40.4358689Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:41.0502906Z [103/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:41.0523486Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:46.7730007Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:51:46.7752902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.7755635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.7756858Z ^ 2025-05-07T19:51:46.7757117Z 2025-05-07T19:51:46.7757586Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.7758230Z 2025-05-07T19:51:46.7759941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.7762747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.7764008Z ^ 2025-05-07T19:51:46.7764385Z 2025-05-07T19:51:46.7766162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.7768980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.7770249Z ^ 2025-05-07T19:51:46.7770507Z 2025-05-07T19:51:46.7770985Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.7771682Z 2025-05-07T19:51:46.7773457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.7776310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.7777571Z ^ 2025-05-07T19:51:46.7777964Z 2025-05-07T19:51:46.7779844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.7782690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.7783922Z ^ 2025-05-07T19:51:46.7784192Z 2025-05-07T19:51:46.7784650Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.7785345Z 2025-05-07T19:51:46.7787123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.7790247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.7791500Z ^ 2025-05-07T19:51:46.7791873Z 2025-05-07T19:51:46.7793789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.7796610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.7797854Z ^ 2025-05-07T19:51:46.7798111Z 2025-05-07T19:51:46.7798559Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.7799262Z 2025-05-07T19:51:46.7801021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.7803565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.7804653Z ^ 2025-05-07T19:51:46.7805032Z 2025-05-07T19:51:46.7806560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.7809097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.7810255Z ^ 2025-05-07T19:51:46.7810526Z 2025-05-07T19:51:46.7810982Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.7811681Z 2025-05-07T19:51:46.7813424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.7816271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.7817512Z ^ 2025-05-07T19:51:46.7817880Z 2025-05-07T19:51:46.8590251Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:51:46.8612605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.8615360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.8616532Z ^ 2025-05-07T19:51:46.8616775Z 2025-05-07T19:51:46.8617210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.8617864Z 2025-05-07T19:51:46.8619370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.8622398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.8623565Z ^ 2025-05-07T19:51:46.8623965Z 2025-05-07T19:51:46.8625677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.8628305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.8629435Z ^ 2025-05-07T19:51:46.8629677Z 2025-05-07T19:51:46.8630106Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.8630777Z 2025-05-07T19:51:46.8632505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.8635158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.8636340Z ^ 2025-05-07T19:51:46.8636711Z 2025-05-07T19:51:46.8638229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.8640948Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.8642156Z ^ 2025-05-07T19:51:46.8642375Z 2025-05-07T19:51:46.8642749Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.8643404Z 2025-05-07T19:51:46.8645022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.8648109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.8649600Z ^ 2025-05-07T19:51:46.8650000Z 2025-05-07T19:51:46.8651526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.8654094Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.8655288Z ^ 2025-05-07T19:51:46.8655556Z 2025-05-07T19:51:46.8656015Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.8656628Z 2025-05-07T19:51:46.8658313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.8661191Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.8662384Z ^ 2025-05-07T19:51:46.8662746Z 2025-05-07T19:51:46.8664447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.8666987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.8668114Z ^ 2025-05-07T19:51:46.8668341Z 2025-05-07T19:51:46.8668766Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.8669437Z 2025-05-07T19:51:46.8671156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.8673858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.8675093Z ^ 2025-05-07T19:51:46.8675479Z 2025-05-07T19:51:46.9879033Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:51:46.9899406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.9901863Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.9902906Z ^ 2025-05-07T19:51:46.9903144Z 2025-05-07T19:51:46.9903567Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.9904165Z 2025-05-07T19:51:46.9905618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.9908117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.9909325Z ^ 2025-05-07T19:51:46.9909654Z 2025-05-07T19:51:46.9911130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.9913541Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.9914560Z ^ 2025-05-07T19:51:46.9914797Z 2025-05-07T19:51:46.9915222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.9915844Z 2025-05-07T19:51:46.9917384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.9919808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.9920923Z ^ 2025-05-07T19:51:46.9921217Z 2025-05-07T19:51:46.9922836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.9924993Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.9925913Z ^ 2025-05-07T19:51:46.9926138Z 2025-05-07T19:51:46.9926945Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.9927563Z 2025-05-07T19:51:46.9929032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.9934014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.9935266Z ^ 2025-05-07T19:51:46.9935597Z 2025-05-07T19:51:46.9937041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.9939150Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.9940351Z ^ 2025-05-07T19:51:46.9940592Z 2025-05-07T19:51:46.9940987Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.9941605Z 2025-05-07T19:51:46.9943138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.9945561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.9946613Z ^ 2025-05-07T19:51:46.9946914Z 2025-05-07T19:51:46.9948452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.9950912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.9951929Z ^ 2025-05-07T19:51:46.9952161Z 2025-05-07T19:51:46.9952583Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:46.9953161Z 2025-05-07T19:51:46.9954629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:46.9957073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:46.9958132Z ^ 2025-05-07T19:51:46.9958459Z 2025-05-07T19:51:47.3497483Z [107/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:51:47.3517956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.3520462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.3521569Z ^ 2025-05-07T19:51:47.3521803Z 2025-05-07T19:51:47.3522496Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.3523105Z 2025-05-07T19:51:47.3524691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.3527083Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.3528041Z ^ 2025-05-07T19:51:47.3528383Z 2025-05-07T19:51:47.3529912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.3532381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.3533471Z ^ 2025-05-07T19:51:47.3533708Z 2025-05-07T19:51:47.3534131Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.3534741Z 2025-05-07T19:51:47.3536262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.3538751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.3539990Z ^ 2025-05-07T19:51:47.3540334Z 2025-05-07T19:51:47.3541888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.3544246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.3545635Z ^ 2025-05-07T19:51:47.3545874Z 2025-05-07T19:51:47.3546276Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.3546789Z 2025-05-07T19:51:47.3548485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.3550669Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.3551743Z ^ 2025-05-07T19:51:47.3552090Z 2025-05-07T19:51:47.3553614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.3556159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.3557224Z ^ 2025-05-07T19:51:47.3557481Z 2025-05-07T19:51:47.3557912Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.3558556Z 2025-05-07T19:51:47.3560120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.3562483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.3563475Z ^ 2025-05-07T19:51:47.3563810Z 2025-05-07T19:51:47.3565360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.3567855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.3568915Z ^ 2025-05-07T19:51:47.3569136Z 2025-05-07T19:51:47.3569543Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.3570173Z 2025-05-07T19:51:47.3571708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.3574294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.3575435Z ^ 2025-05-07T19:51:47.3575777Z 2025-05-07T19:51:47.5266885Z [108/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:51:47.5289967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.5292672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.5293862Z ^ 2025-05-07T19:51:47.5294121Z 2025-05-07T19:51:47.5294588Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.5295222Z 2025-05-07T19:51:47.5296974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.5299812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.5300998Z ^ 2025-05-07T19:51:47.5301355Z 2025-05-07T19:51:47.5303022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.5305684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.5306916Z ^ 2025-05-07T19:51:47.5307185Z 2025-05-07T19:51:47.5307648Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.5308344Z 2025-05-07T19:51:47.5310019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.5312698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.5313896Z ^ 2025-05-07T19:51:47.5314254Z 2025-05-07T19:51:47.5315887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.5318807Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.5320016Z ^ 2025-05-07T19:51:47.5320253Z 2025-05-07T19:51:47.5320782Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.5321461Z 2025-05-07T19:51:47.5323171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.5325820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.5327000Z ^ 2025-05-07T19:51:47.5327376Z 2025-05-07T19:51:47.5329021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.5331675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.5332849Z ^ 2025-05-07T19:51:47.5333128Z 2025-05-07T19:51:47.5333579Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.5334275Z 2025-05-07T19:51:47.5335973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.5338730Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.5340055Z ^ 2025-05-07T19:51:47.5340431Z 2025-05-07T19:51:47.5342095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.5344872Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.5346123Z ^ 2025-05-07T19:51:47.5346390Z 2025-05-07T19:51:47.5346855Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.5347596Z 2025-05-07T19:51:47.5349208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.5351936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.5353139Z ^ 2025-05-07T19:51:47.5353535Z 2025-05-07T19:51:48.0595213Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:51:48.0618660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.0621517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.0622934Z ^ 2025-05-07T19:51:48.0623182Z 2025-05-07T19:51:48.0623649Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:48.0624308Z 2025-05-07T19:51:48.0626000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.0628786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.0630011Z ^ 2025-05-07T19:51:48.0630371Z 2025-05-07T19:51:48.0632045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.0634759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.0635995Z ^ 2025-05-07T19:51:48.0636254Z 2025-05-07T19:51:48.0636716Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:48.0637396Z 2025-05-07T19:51:48.0639090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.0641846Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.0643091Z ^ 2025-05-07T19:51:48.0643779Z 2025-05-07T19:51:48.0645496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.0648369Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.0649611Z ^ 2025-05-07T19:51:48.0649872Z 2025-05-07T19:51:48.0650352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:48.0651045Z 2025-05-07T19:51:48.0652774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.0655545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.0656830Z ^ 2025-05-07T19:51:48.0657215Z 2025-05-07T19:51:48.0658919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.0661849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.0663096Z ^ 2025-05-07T19:51:48.0663387Z 2025-05-07T19:51:48.0663860Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:48.0664537Z 2025-05-07T19:51:48.0666255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.0669030Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.0670275Z ^ 2025-05-07T19:51:48.0670656Z 2025-05-07T19:51:48.0672473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.0675207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.0676472Z ^ 2025-05-07T19:51:48.0676731Z 2025-05-07T19:51:48.0677202Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:48.0677934Z 2025-05-07T19:51:48.0679627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.0682350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.0683580Z ^ 2025-05-07T19:51:48.0683977Z 2025-05-07T19:51:48.5807469Z [110/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:51:48.5831104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.5833936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.5835155Z ^ 2025-05-07T19:51:48.5835436Z 2025-05-07T19:51:48.5835887Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:48.5836598Z 2025-05-07T19:51:48.5837970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.5840618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.5841796Z ^ 2025-05-07T19:51:48.5842189Z 2025-05-07T19:51:48.5843931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.5846651Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.5847852Z ^ 2025-05-07T19:51:48.5848101Z 2025-05-07T19:51:48.5848574Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:48.5849239Z 2025-05-07T19:51:48.5850923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.5853666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.5855183Z ^ 2025-05-07T19:51:48.5855551Z 2025-05-07T19:51:48.5857319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.5860176Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.5861337Z ^ 2025-05-07T19:51:48.5861629Z 2025-05-07T19:51:48.5862071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:48.5862727Z 2025-05-07T19:51:48.5864482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.5867191Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.5868420Z ^ 2025-05-07T19:51:48.5868769Z 2025-05-07T19:51:48.5870511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.5873259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.5874481Z ^ 2025-05-07T19:51:48.5874687Z 2025-05-07T19:51:48.5875046Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:48.5875652Z 2025-05-07T19:51:48.5877089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.5879819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.5881012Z ^ 2025-05-07T19:51:48.5881398Z 2025-05-07T19:51:48.5883049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.5885821Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.5887004Z ^ 2025-05-07T19:51:48.5887278Z 2025-05-07T19:51:48.5887749Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:48.5888427Z 2025-05-07T19:51:48.5890030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:48.5893098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:48.5894108Z ^ 2025-05-07T19:51:48.5894470Z 2025-05-07T19:51:48.6002901Z [111/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:51:50.3799381Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:51:50.3822306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.3825058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.3826265Z ^ 2025-05-07T19:51:50.3826529Z 2025-05-07T19:51:50.3827006Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.3827688Z 2025-05-07T19:51:50.3829360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.3832124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.3833334Z ^ 2025-05-07T19:51:50.3833730Z 2025-05-07T19:51:50.3835418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.3838097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.3839244Z ^ 2025-05-07T19:51:50.3839525Z 2025-05-07T19:51:50.3839986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.3840670Z 2025-05-07T19:51:50.3842362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.3845044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.3846259Z ^ 2025-05-07T19:51:50.3846625Z 2025-05-07T19:51:50.3848295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.3850955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.3852136Z ^ 2025-05-07T19:51:50.3852391Z 2025-05-07T19:51:50.3852820Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.3853482Z 2025-05-07T19:51:50.3855162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.3857822Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.3858945Z ^ 2025-05-07T19:51:50.3859316Z 2025-05-07T19:51:50.3861039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.3864070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.3865236Z ^ 2025-05-07T19:51:50.3865502Z 2025-05-07T19:51:50.3865924Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.3866555Z 2025-05-07T19:51:50.3868376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.3871046Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.3872239Z ^ 2025-05-07T19:51:50.3872603Z 2025-05-07T19:51:50.3874258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.3876950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.3878112Z ^ 2025-05-07T19:51:50.3878363Z 2025-05-07T19:51:50.3878790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.3879473Z 2025-05-07T19:51:50.3881073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.3883736Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.3884904Z ^ 2025-05-07T19:51:50.3885265Z 2025-05-07T19:51:50.5233861Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:51:50.5258622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.5261520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.5262717Z ^ 2025-05-07T19:51:50.5262994Z 2025-05-07T19:51:50.5263526Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.5264229Z 2025-05-07T19:51:50.5265974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.5268770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.5269907Z ^ 2025-05-07T19:51:50.5270253Z 2025-05-07T19:51:50.5271882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.5274623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.5275813Z ^ 2025-05-07T19:51:50.5276063Z 2025-05-07T19:51:50.5276518Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.5277198Z 2025-05-07T19:51:50.5278913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.5281652Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.5282868Z ^ 2025-05-07T19:51:50.5283248Z 2025-05-07T19:51:50.5284953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.5287705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.5288892Z ^ 2025-05-07T19:51:50.5289148Z 2025-05-07T19:51:50.5289636Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.5290316Z 2025-05-07T19:51:50.5292006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.5294715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.5295938Z ^ 2025-05-07T19:51:50.5296311Z 2025-05-07T19:51:50.5298246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.5301423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.5302674Z ^ 2025-05-07T19:51:50.5302932Z 2025-05-07T19:51:50.5303409Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.5304119Z 2025-05-07T19:51:50.5305893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.5308602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.5309817Z ^ 2025-05-07T19:51:50.5310191Z 2025-05-07T19:51:50.5311915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.5314721Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.5315929Z ^ 2025-05-07T19:51:50.5316197Z 2025-05-07T19:51:50.5316646Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.5317323Z 2025-05-07T19:51:50.5319077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.5321874Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.5323325Z ^ 2025-05-07T19:51:50.5323695Z 2025-05-07T19:51:58.9928563Z [114/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:51:58.9949745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.9952310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.9953460Z ^ 2025-05-07T19:51:58.9953716Z 2025-05-07T19:51:58.9954149Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.9954763Z 2025-05-07T19:51:58.9956382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.9958975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.9960108Z ^ 2025-05-07T19:51:58.9960429Z 2025-05-07T19:51:58.9961986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.9964500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.9965620Z ^ 2025-05-07T19:51:58.9965867Z 2025-05-07T19:51:58.9966364Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.9966984Z 2025-05-07T19:51:58.9968536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.9971001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.9972052Z ^ 2025-05-07T19:51:58.9972398Z 2025-05-07T19:51:58.9973942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.9976568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.9977644Z ^ 2025-05-07T19:51:58.9977898Z 2025-05-07T19:51:58.9978323Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.9978962Z 2025-05-07T19:51:58.9980643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.9983198Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.9984633Z ^ 2025-05-07T19:51:58.9984978Z 2025-05-07T19:51:58.9986742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.9989257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.9990342Z ^ 2025-05-07T19:51:58.9990575Z 2025-05-07T19:51:58.9990997Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.9991642Z 2025-05-07T19:51:58.9993205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.9995778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.9996902Z ^ 2025-05-07T19:51:58.9997255Z 2025-05-07T19:51:58.9998828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.0001336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.0002421Z ^ 2025-05-07T19:51:59.0002684Z 2025-05-07T19:51:59.0003103Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:59.0003702Z 2025-05-07T19:51:59.0005274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.0007854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.0009015Z ^ 2025-05-07T19:51:59.0009354Z 2025-05-07T19:51:59.8214561Z [115/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:51:59.8230727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.8232596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.8233390Z ^ 2025-05-07T19:51:59.8233591Z 2025-05-07T19:51:59.8233914Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:59.8234364Z 2025-05-07T19:51:59.8235497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.8237293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.8238120Z ^ 2025-05-07T19:51:59.8238383Z 2025-05-07T19:51:59.8239508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.8241307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.8242117Z ^ 2025-05-07T19:51:59.8242286Z 2025-05-07T19:51:59.8242585Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:59.8243045Z 2025-05-07T19:51:59.8244238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.8246353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.8247323Z ^ 2025-05-07T19:51:59.8247623Z 2025-05-07T19:51:59.8248928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.8251025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.8251961Z ^ 2025-05-07T19:51:59.8252190Z 2025-05-07T19:51:59.8252542Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:59.8253059Z 2025-05-07T19:51:59.8254379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.8256836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.8257768Z ^ 2025-05-07T19:51:59.8258048Z 2025-05-07T19:51:59.8259700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.8261804Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.8262724Z ^ 2025-05-07T19:51:59.8262920Z 2025-05-07T19:51:59.8263267Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:59.8263794Z 2025-05-07T19:51:59.8265091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.8267190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.8268125Z ^ 2025-05-07T19:51:59.8268417Z 2025-05-07T19:51:59.8269700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.8271781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.8272678Z ^ 2025-05-07T19:51:59.8272876Z 2025-05-07T19:51:59.8273237Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:59.8273768Z 2025-05-07T19:51:59.8275066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.8277197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.8278142Z ^ 2025-05-07T19:51:59.8278421Z 2025-05-07T19:52:03.1441017Z [116/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:52:03.1458969Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:03.7944316Z [117/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T19:52:04.4276349Z [118/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T19:52:04.4558826Z [119/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:52:04.4581823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4584552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4585750Z ^ 2025-05-07T19:52:04.4586010Z 2025-05-07T19:52:04.4586466Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.4587156Z 2025-05-07T19:52:04.4588852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4591592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4592812Z ^ 2025-05-07T19:52:04.4593166Z 2025-05-07T19:52:04.4594748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4597325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4598441Z ^ 2025-05-07T19:52:04.4598679Z 2025-05-07T19:52:04.4599091Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.4599766Z 2025-05-07T19:52:04.4601289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4603860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4604936Z ^ 2025-05-07T19:52:04.4605255Z 2025-05-07T19:52:04.4606712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4609078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4610227Z ^ 2025-05-07T19:52:04.4610498Z 2025-05-07T19:52:04.4610910Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.4611500Z 2025-05-07T19:52:04.4613021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4615532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4616511Z ^ 2025-05-07T19:52:04.4616810Z 2025-05-07T19:52:04.4618095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4620559Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4621926Z ^ 2025-05-07T19:52:04.4622409Z 2025-05-07T19:52:04.4622935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.4623595Z 2025-05-07T19:52:04.4625416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4628170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4629381Z ^ 2025-05-07T19:52:04.4629762Z 2025-05-07T19:52:04.4631434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4634260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4635423Z ^ 2025-05-07T19:52:04.4635645Z 2025-05-07T19:52:04.4636077Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.4636696Z 2025-05-07T19:52:04.4638232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4640707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4641822Z ^ 2025-05-07T19:52:04.4642156Z 2025-05-07T19:52:07.4956749Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:52:07.4978334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4981079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4982466Z ^ 2025-05-07T19:52:07.4982701Z 2025-05-07T19:52:07.4983132Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.4983842Z 2025-05-07T19:52:07.4985449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4988084Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4989172Z ^ 2025-05-07T19:52:07.4989525Z 2025-05-07T19:52:07.4991096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4993663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4994811Z ^ 2025-05-07T19:52:07.4995051Z 2025-05-07T19:52:07.4995499Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.4996150Z 2025-05-07T19:52:07.4997776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.5000318Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.5001459Z ^ 2025-05-07T19:52:07.5001820Z 2025-05-07T19:52:07.5003405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.5006034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.5007110Z ^ 2025-05-07T19:52:07.5007342Z 2025-05-07T19:52:07.5007777Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.5008396Z 2025-05-07T19:52:07.5010007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.5012623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.5013732Z ^ 2025-05-07T19:52:07.5014079Z 2025-05-07T19:52:07.5015752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.5018646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.5019967Z ^ 2025-05-07T19:52:07.5020215Z 2025-05-07T19:52:07.5020856Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.5021511Z 2025-05-07T19:52:07.5023417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.5026041Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.5027176Z ^ 2025-05-07T19:52:07.5027554Z 2025-05-07T19:52:07.5029123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.5031789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.5032962Z ^ 2025-05-07T19:52:07.5033230Z 2025-05-07T19:52:07.5033654Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.5034278Z 2025-05-07T19:52:07.5035900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.5038446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.5039626Z ^ 2025-05-07T19:52:07.5039991Z 2025-05-07T19:52:11.0438123Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:52:11.0461409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.0464110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.0465244Z ^ 2025-05-07T19:52:11.0465483Z 2025-05-07T19:52:11.0465900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.0466557Z 2025-05-07T19:52:11.0468148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.0470728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.0471855Z ^ 2025-05-07T19:52:11.0472238Z 2025-05-07T19:52:11.0473809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.0476436Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.0477618Z ^ 2025-05-07T19:52:11.0477870Z 2025-05-07T19:52:11.0478287Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.0478914Z 2025-05-07T19:52:11.0480495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.0483026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.0484190Z ^ 2025-05-07T19:52:11.0484529Z 2025-05-07T19:52:11.0486133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.0488672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.0489844Z ^ 2025-05-07T19:52:11.0490089Z 2025-05-07T19:52:11.0490523Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.0491227Z 2025-05-07T19:52:11.0492917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.0495471Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.0496836Z ^ 2025-05-07T19:52:11.0497182Z 2025-05-07T19:52:11.0498875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.0501549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.0502655Z ^ 2025-05-07T19:52:11.0515195Z 2025-05-07T19:52:11.0515741Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.0516419Z 2025-05-07T19:52:11.0518058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.0520695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.0521839Z ^ 2025-05-07T19:52:11.0522387Z 2025-05-07T19:52:11.0523969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.0526521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.0527672Z ^ 2025-05-07T19:52:11.0527914Z 2025-05-07T19:52:11.0528339Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.0528975Z 2025-05-07T19:52:11.0530597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.0533298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.0534423Z ^ 2025-05-07T19:52:11.0534761Z 2025-05-07T19:52:12.6818809Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:12.6841554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6844179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6845264Z ^ 2025-05-07T19:52:12.6845515Z 2025-05-07T19:52:12.6845945Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:12.6846591Z 2025-05-07T19:52:12.6848200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6850745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6851903Z ^ 2025-05-07T19:52:12.6852232Z 2025-05-07T19:52:12.6853843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6856436Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6857537Z ^ 2025-05-07T19:52:12.6857782Z 2025-05-07T19:52:12.6858222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:12.6858841Z 2025-05-07T19:52:12.6860508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6863096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6864201Z ^ 2025-05-07T19:52:12.6864556Z 2025-05-07T19:52:12.6866133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6868671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6869820Z ^ 2025-05-07T19:52:12.6870073Z 2025-05-07T19:52:12.6870503Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:12.6871546Z 2025-05-07T19:52:12.6873083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6875884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6877013Z ^ 2025-05-07T19:52:12.6877356Z 2025-05-07T19:52:12.6878955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6881522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6882681Z ^ 2025-05-07T19:52:12.6882906Z 2025-05-07T19:52:12.6883351Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:12.6883965Z 2025-05-07T19:52:12.6885459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6887995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6889155Z ^ 2025-05-07T19:52:12.6889511Z 2025-05-07T19:52:12.6891079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6893441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6894613Z ^ 2025-05-07T19:52:12.6894858Z 2025-05-07T19:52:12.6895258Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:12.6895887Z 2025-05-07T19:52:12.6897419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6900059Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6901167Z ^ 2025-05-07T19:52:12.6901499Z 2025-05-07T19:52:13.2775240Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:52:13.2796419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.2798907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.2800023Z ^ 2025-05-07T19:52:13.2800260Z 2025-05-07T19:52:13.2800700Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.2801323Z 2025-05-07T19:52:13.2802919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.2805552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.2806632Z ^ 2025-05-07T19:52:13.2806963Z 2025-05-07T19:52:13.2808460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.2811074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.2812260Z ^ 2025-05-07T19:52:13.2812503Z 2025-05-07T19:52:13.2812903Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.2813486Z 2025-05-07T19:52:13.2814913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.2817253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.2818291Z ^ 2025-05-07T19:52:13.2818600Z 2025-05-07T19:52:13.2820432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.2823230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.2824766Z ^ 2025-05-07T19:52:13.2825021Z 2025-05-07T19:52:13.2825389Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.2825980Z 2025-05-07T19:52:13.2827657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.2830137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.2831290Z ^ 2025-05-07T19:52:13.2831607Z 2025-05-07T19:52:13.2833143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.2835497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.2836558Z ^ 2025-05-07T19:52:13.2836807Z 2025-05-07T19:52:13.2837228Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.2837815Z 2025-05-07T19:52:13.2839428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.2841853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.2842952Z ^ 2025-05-07T19:52:13.2843298Z 2025-05-07T19:52:13.2844872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.2847106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.2848146Z ^ 2025-05-07T19:52:13.2848379Z 2025-05-07T19:52:13.2848713Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.2849239Z 2025-05-07T19:52:13.2850760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.2853302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.2854347Z ^ 2025-05-07T19:52:13.2854713Z 2025-05-07T19:52:13.3171298Z [124/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:52:13.3184662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.3186102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.3186788Z ^ 2025-05-07T19:52:13.3186928Z 2025-05-07T19:52:13.3187188Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.3187545Z 2025-05-07T19:52:13.3188412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.3189812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.3190446Z ^ 2025-05-07T19:52:13.3190639Z 2025-05-07T19:52:13.3191486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.3192912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.3193536Z ^ 2025-05-07T19:52:13.3193674Z 2025-05-07T19:52:13.3193919Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.3194283Z 2025-05-07T19:52:13.3195147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.3196553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.3197170Z ^ 2025-05-07T19:52:13.3197378Z 2025-05-07T19:52:13.3198219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.3199603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.3200372Z ^ 2025-05-07T19:52:13.3200507Z 2025-05-07T19:52:13.3200759Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.3201111Z 2025-05-07T19:52:13.3202051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.3203449Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.3204075Z ^ 2025-05-07T19:52:13.3204266Z 2025-05-07T19:52:13.3205120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.3206504Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.3207121Z ^ 2025-05-07T19:52:13.3207257Z 2025-05-07T19:52:13.3207490Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.3207859Z 2025-05-07T19:52:13.3208718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.3210103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.3210721Z ^ 2025-05-07T19:52:13.3210912Z 2025-05-07T19:52:13.3211768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.3213129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.3213750Z ^ 2025-05-07T19:52:13.3213885Z 2025-05-07T19:52:13.3214132Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.3214474Z 2025-05-07T19:52:13.3215334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.3216717Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.3217350Z ^ 2025-05-07T19:52:13.3217539Z 2025-05-07T19:52:13.4175278Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:52:13.4198168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.4200670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.4201857Z ^ 2025-05-07T19:52:13.4202104Z 2025-05-07T19:52:13.4202511Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.4203101Z 2025-05-07T19:52:13.4204612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.4207178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.4208338Z ^ 2025-05-07T19:52:13.4208671Z 2025-05-07T19:52:13.4210215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.4212735Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.4213883Z ^ 2025-05-07T19:52:13.4214128Z 2025-05-07T19:52:13.4214553Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.4215181Z 2025-05-07T19:52:13.4216861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.4219673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.4220911Z ^ 2025-05-07T19:52:13.4221262Z 2025-05-07T19:52:13.4223167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.4226147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.4227400Z ^ 2025-05-07T19:52:13.4227654Z 2025-05-07T19:52:13.4228076Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.4228780Z 2025-05-07T19:52:13.4230406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.4233159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.4234350Z ^ 2025-05-07T19:52:13.4234682Z 2025-05-07T19:52:13.4236411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.4239120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.4240229Z ^ 2025-05-07T19:52:13.4240468Z 2025-05-07T19:52:13.4240938Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.4241663Z 2025-05-07T19:52:13.4243254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.4246165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.4247335Z ^ 2025-05-07T19:52:13.4247736Z 2025-05-07T19:52:13.4249448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.4252243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.4253380Z ^ 2025-05-07T19:52:13.4253618Z 2025-05-07T19:52:13.4254105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.4254839Z 2025-05-07T19:52:13.4256551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.4259033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.4260356Z ^ 2025-05-07T19:52:13.4260770Z 2025-05-07T19:52:44.3938902Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:52:44.3962174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.3964958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.3966180Z ^ 2025-05-07T19:52:44.3966424Z 2025-05-07T19:52:44.3966888Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:44.3967574Z 2025-05-07T19:52:44.3969245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.3971945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.3973136Z ^ 2025-05-07T19:52:44.3973499Z 2025-05-07T19:52:44.3975223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.3977903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.3979123Z ^ 2025-05-07T19:52:44.3979387Z 2025-05-07T19:52:44.3979961Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:44.3980575Z 2025-05-07T19:52:44.3982323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.3984960Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.3986338Z ^ 2025-05-07T19:52:44.3986674Z 2025-05-07T19:52:44.3988461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.3991154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.3992352Z ^ 2025-05-07T19:52:44.3992598Z 2025-05-07T19:52:44.3993055Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:44.3993735Z 2025-05-07T19:52:44.3995353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.3998168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.3999400Z ^ 2025-05-07T19:52:44.3999770Z 2025-05-07T19:52:44.4001528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4004030Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4005257Z ^ 2025-05-07T19:52:44.4005515Z 2025-05-07T19:52:44.4005988Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:44.4006675Z 2025-05-07T19:52:44.4008414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4011230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4012478Z ^ 2025-05-07T19:52:44.4012848Z 2025-05-07T19:52:44.4014589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4017161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4018279Z ^ 2025-05-07T19:52:44.4018531Z 2025-05-07T19:52:44.4018991Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:44.4019859Z 2025-05-07T19:52:44.4021608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4024657Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4025894Z ^ 2025-05-07T19:52:44.4026265Z 2025-05-07T19:52:45.0497191Z [127/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:52:46.2819625Z [128/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:52:46.2843911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.2846638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.2847822Z ^ 2025-05-07T19:52:46.2848092Z 2025-05-07T19:52:46.2848575Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.2849251Z 2025-05-07T19:52:46.2850951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.2853723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.2854938Z ^ 2025-05-07T19:52:46.2855298Z 2025-05-07T19:52:46.2857007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.2859913Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.2861131Z ^ 2025-05-07T19:52:46.2861379Z 2025-05-07T19:52:46.2861828Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.2862518Z 2025-05-07T19:52:46.2864225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.2866979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.2868177Z ^ 2025-05-07T19:52:46.2868545Z 2025-05-07T19:52:46.2870164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.2873143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.2874314Z ^ 2025-05-07T19:52:46.2874562Z 2025-05-07T19:52:46.2874986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.2875782Z 2025-05-07T19:52:46.2877486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.2880128Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.2881293Z ^ 2025-05-07T19:52:46.2881633Z 2025-05-07T19:52:46.2883282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.2886005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.2887185Z ^ 2025-05-07T19:52:46.2887442Z 2025-05-07T19:52:46.2887884Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.2888561Z 2025-05-07T19:52:46.2890242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.2892901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.2894010Z ^ 2025-05-07T19:52:46.2894355Z 2025-05-07T19:52:46.2895902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.2898406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.2899664Z ^ 2025-05-07T19:52:46.2899902Z 2025-05-07T19:52:46.2900307Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.2900901Z 2025-05-07T19:52:46.2902310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.2904567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.2905678Z ^ 2025-05-07T19:52:46.2905998Z 2025-05-07T19:52:46.8459326Z [129/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:52:47.5552869Z [130/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:47.5575473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.5578422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.5579705Z ^ 2025-05-07T19:52:47.5579929Z 2025-05-07T19:52:47.5580289Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:47.5580849Z 2025-05-07T19:52:47.5582331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.5584612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.5585654Z ^ 2025-05-07T19:52:47.5585946Z 2025-05-07T19:52:47.5587356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.5589698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.5590752Z ^ 2025-05-07T19:52:47.5590978Z 2025-05-07T19:52:47.5591361Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:47.5591952Z 2025-05-07T19:52:47.5593650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.5596053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.5597126Z ^ 2025-05-07T19:52:47.5597488Z 2025-05-07T19:52:47.5598995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.5601704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.5602879Z ^ 2025-05-07T19:52:47.5603146Z 2025-05-07T19:52:47.5603598Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:47.5604298Z 2025-05-07T19:52:47.5606017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.5608565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.5609605Z ^ 2025-05-07T19:52:47.5609964Z 2025-05-07T19:52:47.5611656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.5614383Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.5615581Z ^ 2025-05-07T19:52:47.5616089Z 2025-05-07T19:52:47.5616545Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:47.5617216Z 2025-05-07T19:52:47.5618969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.5621761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.5623396Z ^ 2025-05-07T19:52:47.5623767Z 2025-05-07T19:52:47.5625475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.5628182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.5629317Z ^ 2025-05-07T19:52:47.5629551Z 2025-05-07T19:52:47.5629927Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:47.5630564Z 2025-05-07T19:52:47.5632272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.5635048Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.5636268Z ^ 2025-05-07T19:52:47.5636635Z 2025-05-07T19:52:56.2988296Z [131/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:52:56.3010167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.3012684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.3013703Z ^ 2025-05-07T19:52:56.3013974Z 2025-05-07T19:52:56.3014412Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:56.3015032Z 2025-05-07T19:52:56.3016723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.3019298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.3020488Z ^ 2025-05-07T19:52:56.3020850Z 2025-05-07T19:52:56.3022742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.3025363Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.3026433Z ^ 2025-05-07T19:52:56.3026674Z 2025-05-07T19:52:56.3027101Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:56.3027716Z 2025-05-07T19:52:56.3029399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.3032066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.3033150Z ^ 2025-05-07T19:52:56.3033525Z 2025-05-07T19:52:56.3035172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.3037795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.3038867Z ^ 2025-05-07T19:52:56.3039095Z 2025-05-07T19:52:56.3039526Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:56.3040156Z 2025-05-07T19:52:56.3041768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.3044267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.3045403Z ^ 2025-05-07T19:52:56.3045727Z 2025-05-07T19:52:56.3047158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.3049805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.3051308Z ^ 2025-05-07T19:52:56.3051517Z 2025-05-07T19:52:56.3051884Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:56.3052487Z 2025-05-07T19:52:56.3054192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.3056743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.3057907Z ^ 2025-05-07T19:52:56.3058283Z 2025-05-07T19:52:56.3060012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.3062584Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.3063633Z ^ 2025-05-07T19:52:56.3063859Z 2025-05-07T19:52:56.3064283Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:56.3064899Z 2025-05-07T19:52:56.3066555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.3069234Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.3070293Z ^ 2025-05-07T19:52:56.3070629Z 2025-05-07T19:53:01.4374486Z [132/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:01.4392304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.4394136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.4394988Z ^ 2025-05-07T19:53:01.4395206Z 2025-05-07T19:53:01.4395548Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.4396038Z 2025-05-07T19:53:01.4397313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.4399446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.4400425Z ^ 2025-05-07T19:53:01.4400718Z 2025-05-07T19:53:01.4402115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.4404246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.4405249Z ^ 2025-05-07T19:53:01.4405460Z 2025-05-07T19:53:01.4405844Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.4406371Z 2025-05-07T19:53:01.4407699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.4409643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.4410479Z ^ 2025-05-07T19:53:01.4410761Z 2025-05-07T19:53:01.4412014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.4413951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.4414810Z ^ 2025-05-07T19:53:01.4415033Z 2025-05-07T19:53:01.4415371Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.4415889Z 2025-05-07T19:53:01.4417228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.4419322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.4420384Z ^ 2025-05-07T19:53:01.4420665Z 2025-05-07T19:53:01.4422230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.4424552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.4425385Z ^ 2025-05-07T19:53:01.4425728Z 2025-05-07T19:53:01.4426054Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.4426533Z 2025-05-07T19:53:01.4427707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.4429606Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.4430447Z ^ 2025-05-07T19:53:01.4430717Z 2025-05-07T19:53:01.4431905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.4434049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.4435032Z ^ 2025-05-07T19:53:01.4435229Z 2025-05-07T19:53:01.4435609Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.4436169Z 2025-05-07T19:53:01.4437364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.4439281Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.4440157Z ^ 2025-05-07T19:53:01.4440435Z 2025-05-07T19:53:02.9251998Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:53:02.9276449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.9279371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.9280618Z ^ 2025-05-07T19:53:02.9280885Z 2025-05-07T19:53:02.9281377Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:02.9282083Z 2025-05-07T19:53:02.9283870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.9286754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.9288040Z ^ 2025-05-07T19:53:02.9288426Z 2025-05-07T19:53:02.9290169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.9293041Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.9294322Z ^ 2025-05-07T19:53:02.9294599Z 2025-05-07T19:53:02.9295070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:02.9295785Z 2025-05-07T19:53:02.9297574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.9300653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.9301929Z ^ 2025-05-07T19:53:02.9302333Z 2025-05-07T19:53:02.9304115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.9306975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.9308232Z ^ 2025-05-07T19:53:02.9308503Z 2025-05-07T19:53:02.9308992Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:02.9309699Z 2025-05-07T19:53:02.9311448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.9314317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.9315904Z ^ 2025-05-07T19:53:02.9316305Z 2025-05-07T19:53:02.9318199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.9321013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.9322491Z ^ 2025-05-07T19:53:02.9322774Z 2025-05-07T19:53:02.9323235Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:02.9323919Z 2025-05-07T19:53:02.9325669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.9328370Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.9329596Z ^ 2025-05-07T19:53:02.9329961Z 2025-05-07T19:53:02.9331612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.9334366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.9335625Z ^ 2025-05-07T19:53:02.9335895Z 2025-05-07T19:53:02.9336353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:02.9337062Z 2025-05-07T19:53:02.9338821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.9341847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.9343097Z ^ 2025-05-07T19:53:02.9343486Z 2025-05-07T19:53:05.3906390Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:05.3929769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.3932555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.3933747Z ^ 2025-05-07T19:53:05.3934000Z 2025-05-07T19:53:05.3934453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.3935145Z 2025-05-07T19:53:05.3936837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.3939639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.3940846Z ^ 2025-05-07T19:53:05.3941222Z 2025-05-07T19:53:05.3942911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.3945653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.3946836Z ^ 2025-05-07T19:53:05.3947099Z 2025-05-07T19:53:05.3947544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.3948224Z 2025-05-07T19:53:05.3950016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.3952733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.3953927Z ^ 2025-05-07T19:53:05.3954288Z 2025-05-07T19:53:05.3955983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.3958678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.3959861Z ^ 2025-05-07T19:53:05.3960107Z 2025-05-07T19:53:05.3960518Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.3961189Z 2025-05-07T19:53:05.3962880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.3966881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.3968131Z ^ 2025-05-07T19:53:05.3968437Z 2025-05-07T19:53:05.3969935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.3972423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.3973573Z ^ 2025-05-07T19:53:05.3973844Z 2025-05-07T19:53:05.3974290Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.3974955Z 2025-05-07T19:53:05.3976459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.3978777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.3980012Z ^ 2025-05-07T19:53:05.3980357Z 2025-05-07T19:53:05.3981970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.3984549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.3985773Z ^ 2025-05-07T19:53:05.3986023Z 2025-05-07T19:53:05.3986477Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.3987121Z 2025-05-07T19:53:05.3988588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.3991205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.3992438Z ^ 2025-05-07T19:53:05.3992810Z 2025-05-07T19:53:07.2258621Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:07.2283043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.2285992Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.2287243Z ^ 2025-05-07T19:53:07.2287484Z 2025-05-07T19:53:07.2287934Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:07.2288630Z 2025-05-07T19:53:07.2290300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.2293092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.2294325Z ^ 2025-05-07T19:53:07.2294699Z 2025-05-07T19:53:07.2296412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.2299137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.2300524Z ^ 2025-05-07T19:53:07.2300801Z 2025-05-07T19:53:07.2301241Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:07.2301948Z 2025-05-07T19:53:07.2303680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.2306509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.2307756Z ^ 2025-05-07T19:53:07.2308121Z 2025-05-07T19:53:07.2309788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.2312557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.2314049Z ^ 2025-05-07T19:53:07.2314309Z 2025-05-07T19:53:07.2314767Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:07.2315485Z 2025-05-07T19:53:07.2317285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.2319997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.2321231Z ^ 2025-05-07T19:53:07.2321622Z 2025-05-07T19:53:07.2323632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.2326409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.2327626Z ^ 2025-05-07T19:53:07.2327897Z 2025-05-07T19:53:07.2328340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:07.2329000Z 2025-05-07T19:53:07.2330822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.2333691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.2334970Z ^ 2025-05-07T19:53:07.2335348Z 2025-05-07T19:53:07.2337128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.2340134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.2341221Z ^ 2025-05-07T19:53:07.2341468Z 2025-05-07T19:53:07.2341902Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:07.2342617Z 2025-05-07T19:53:07.2344394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.2347240Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.2348330Z ^ 2025-05-07T19:53:07.2348648Z 2025-05-07T19:53:13.6757146Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:53:13.6780771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:13.6783394Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:13.6784536Z ^ 2025-05-07T19:53:13.6784786Z 2025-05-07T19:53:13.6785166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.6785746Z 2025-05-07T19:53:13.6787363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:13.6789888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:13.6791037Z ^ 2025-05-07T19:53:13.6791386Z 2025-05-07T19:53:13.6793023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:13.6795733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:13.6796918Z ^ 2025-05-07T19:53:13.6797170Z 2025-05-07T19:53:13.6797622Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.6798304Z 2025-05-07T19:53:13.6800002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:13.6802728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:13.6803911Z ^ 2025-05-07T19:53:13.6804282Z 2025-05-07T19:53:13.6805954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:13.6808933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:13.6810121Z ^ 2025-05-07T19:53:13.6810364Z 2025-05-07T19:53:13.6810982Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.6811647Z 2025-05-07T19:53:13.6813328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:13.6816011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:13.6817124Z ^ 2025-05-07T19:53:13.6817442Z 2025-05-07T19:53:13.6818765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:13.6821416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:13.6822990Z ^ 2025-05-07T19:53:13.6823263Z 2025-05-07T19:53:13.6823695Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.6824353Z 2025-05-07T19:53:13.6826029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:13.6828820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:13.6830079Z ^ 2025-05-07T19:53:13.6830441Z 2025-05-07T19:53:13.6832157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:13.6834823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:13.6836054Z ^ 2025-05-07T19:53:13.6836310Z 2025-05-07T19:53:13.6836793Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.6837441Z 2025-05-07T19:53:13.6839160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:13.6841899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:13.6843127Z ^ 2025-05-07T19:53:13.6843509Z 2025-05-07T19:53:18.2493106Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:18.2518487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.2521383Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.2522892Z ^ 2025-05-07T19:53:18.2523162Z 2025-05-07T19:53:18.2523653Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.2524345Z 2025-05-07T19:53:18.2526097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.2528973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.2530269Z ^ 2025-05-07T19:53:18.2530649Z 2025-05-07T19:53:18.2532332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2534477Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2535067Z ^ 2025-05-07T19:53:18.2535416Z 2025-05-07T19:53:18.2537087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2539231Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2539927Z ^ 2025-05-07T19:53:18.2540260Z 2025-05-07T19:53:18.2541919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2546424Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2547026Z ^ 2025-05-07T19:53:18.2547337Z 2025-05-07T19:53:18.2549266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.2552087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.2553372Z ^ 2025-05-07T19:53:18.2553639Z 2025-05-07T19:53:18.2554140Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.2554855Z 2025-05-07T19:53:18.2556599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.2559486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.2560753Z ^ 2025-05-07T19:53:18.2561134Z 2025-05-07T19:53:18.2562815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2564945Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2565535Z ^ 2025-05-07T19:53:18.2565865Z 2025-05-07T19:53:18.2567531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2569662Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2570259Z ^ 2025-05-07T19:53:18.2570584Z 2025-05-07T19:53:18.2572258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2574349Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2574910Z ^ 2025-05-07T19:53:18.2575205Z 2025-05-07T19:53:18.2576961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.2579875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.2581109Z ^ 2025-05-07T19:53:18.2581364Z 2025-05-07T19:53:18.2581828Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.2582532Z 2025-05-07T19:53:18.2584276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.2587111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.2588324Z ^ 2025-05-07T19:53:18.2588727Z 2025-05-07T19:53:18.2590408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2592730Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2593325Z ^ 2025-05-07T19:53:18.2593664Z 2025-05-07T19:53:18.2595485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2597617Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2598213Z ^ 2025-05-07T19:53:18.2598511Z 2025-05-07T19:53:18.2600191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2602293Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2602878Z ^ 2025-05-07T19:53:18.2603179Z 2025-05-07T19:53:18.2604930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.2607730Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.2608971Z ^ 2025-05-07T19:53:18.2609226Z 2025-05-07T19:53:18.2609695Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.2610387Z 2025-05-07T19:53:18.2612123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.2614976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.2616194Z ^ 2025-05-07T19:53:18.2616579Z 2025-05-07T19:53:18.2618272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2620519Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2621117Z ^ 2025-05-07T19:53:18.2621437Z 2025-05-07T19:53:18.2623324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2625434Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2626002Z ^ 2025-05-07T19:53:18.2626304Z 2025-05-07T19:53:18.2627998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2630093Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2630670Z ^ 2025-05-07T19:53:18.2630972Z 2025-05-07T19:53:18.2632734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.2635512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.2637105Z ^ 2025-05-07T19:53:18.2637361Z 2025-05-07T19:53:18.2637816Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.2638532Z 2025-05-07T19:53:18.2640417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.2643236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.2644462Z ^ 2025-05-07T19:53:18.2644842Z 2025-05-07T19:53:18.2646511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2648595Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2649179Z ^ 2025-05-07T19:53:18.2649492Z 2025-05-07T19:53:18.2651171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2653262Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2653832Z ^ 2025-05-07T19:53:18.2654131Z 2025-05-07T19:53:18.2655823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.2657901Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.2658470Z ^ 2025-05-07T19:53:18.2658772Z 2025-05-07T19:53:21.2919404Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:21.2944024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.2946960Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.2948229Z ^ 2025-05-07T19:53:21.2948529Z 2025-05-07T19:53:21.2949002Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.2949695Z 2025-05-07T19:53:21.2951512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.2954233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.2955479Z ^ 2025-05-07T19:53:21.2955856Z 2025-05-07T19:53:21.2957561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.2960343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.2961614Z ^ 2025-05-07T19:53:21.2961883Z 2025-05-07T19:53:21.2962380Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.2963098Z 2025-05-07T19:53:21.2964889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.2967794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.2969061Z ^ 2025-05-07T19:53:21.2969474Z 2025-05-07T19:53:21.2971241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.2974036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.2975217Z ^ 2025-05-07T19:53:21.2975487Z 2025-05-07T19:53:21.2975960Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.2976653Z 2025-05-07T19:53:21.2978461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.2981481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.2982650Z ^ 2025-05-07T19:53:21.2982964Z 2025-05-07T19:53:21.2984491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.2987647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.2989050Z ^ 2025-05-07T19:53:21.2989319Z 2025-05-07T19:53:21.2989786Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.2990498Z 2025-05-07T19:53:21.2992225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.2995008Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.2996238Z ^ 2025-05-07T19:53:21.2996630Z 2025-05-07T19:53:21.2998299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.3001123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.3002357Z ^ 2025-05-07T19:53:21.3002639Z 2025-05-07T19:53:21.3003074Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.3003696Z 2025-05-07T19:53:21.3005406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.3008410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.3009667Z ^ 2025-05-07T19:53:21.3010041Z 2025-05-07T19:53:26.8939695Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:26.8962144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.8964725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.8965823Z ^ 2025-05-07T19:53:26.8966067Z 2025-05-07T19:53:26.8966508Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.8967107Z 2025-05-07T19:53:26.8968664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.8971179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.8972318Z ^ 2025-05-07T19:53:26.8972691Z 2025-05-07T19:53:26.8974135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.8976094Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:26.8976878Z ^ 2025-05-07T19:53:26.8977181Z 2025-05-07T19:53:26.8978555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.8980472Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.8981001Z ^ 2025-05-07T19:53:26.8981269Z 2025-05-07T19:53:26.8982573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.8984385Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.8984923Z ^ 2025-05-07T19:53:26.8985176Z 2025-05-07T19:53:26.8986565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.8988323Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.8988833Z ^ 2025-05-07T19:53:26.8989100Z 2025-05-07T19:53:26.8990613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.8993096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.8994534Z ^ 2025-05-07T19:53:26.8994804Z 2025-05-07T19:53:26.8995205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.8995824Z 2025-05-07T19:53:26.8997592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.9000118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.9001248Z ^ 2025-05-07T19:53:26.9001582Z 2025-05-07T19:53:26.9003007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9005020Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:26.9005765Z ^ 2025-05-07T19:53:26.9006034Z 2025-05-07T19:53:26.9007435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9009203Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9009754Z ^ 2025-05-07T19:53:26.9010029Z 2025-05-07T19:53:26.9011438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9013181Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9013701Z ^ 2025-05-07T19:53:26.9013970Z 2025-05-07T19:53:26.9015357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9017204Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9017719Z ^ 2025-05-07T19:53:26.9017987Z 2025-05-07T19:53:26.9019485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.9022391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.9023484Z ^ 2025-05-07T19:53:26.9023730Z 2025-05-07T19:53:26.9024202Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.9024819Z 2025-05-07T19:53:26.9026370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.9028871Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.9029974Z ^ 2025-05-07T19:53:26.9030318Z 2025-05-07T19:53:26.9031735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9033677Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:26.9034912Z ^ 2025-05-07T19:53:26.9035188Z 2025-05-07T19:53:26.9036616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9038625Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9039148Z ^ 2025-05-07T19:53:26.9039457Z 2025-05-07T19:53:26.9040979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9042899Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9043445Z ^ 2025-05-07T19:53:26.9043735Z 2025-05-07T19:53:26.9045239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9047203Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9047751Z ^ 2025-05-07T19:53:26.9048036Z 2025-05-07T19:53:26.9049784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.9052458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.9053693Z ^ 2025-05-07T19:53:26.9053953Z 2025-05-07T19:53:26.9054409Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.9055122Z 2025-05-07T19:53:26.9056869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.9059288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.9060572Z ^ 2025-05-07T19:53:26.9060885Z 2025-05-07T19:53:26.9062242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9064212Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:26.9064894Z ^ 2025-05-07T19:53:26.9065144Z 2025-05-07T19:53:26.9066525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9068334Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9068854Z ^ 2025-05-07T19:53:26.9069146Z 2025-05-07T19:53:26.9070475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9072277Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9072797Z ^ 2025-05-07T19:53:26.9073092Z 2025-05-07T19:53:26.9074542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9076261Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9077041Z ^ 2025-05-07T19:53:26.9077296Z 2025-05-07T19:53:26.9078737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.9081380Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.9082478Z ^ 2025-05-07T19:53:26.9082705Z 2025-05-07T19:53:26.9083145Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.9083723Z 2025-05-07T19:53:26.9085219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.9087700Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.9088806Z ^ 2025-05-07T19:53:26.9089151Z 2025-05-07T19:53:26.9090538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9092508Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:26.9093199Z ^ 2025-05-07T19:53:26.9093496Z 2025-05-07T19:53:26.9094857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9096630Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9097163Z ^ 2025-05-07T19:53:26.9097454Z 2025-05-07T19:53:26.9098873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9100860Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9101368Z ^ 2025-05-07T19:53:26.9101620Z 2025-05-07T19:53:26.9103018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.9104810Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.9105336Z ^ 2025-05-07T19:53:26.9105600Z 2025-05-07T19:53:27.6144491Z [140/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:27.6166894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6169433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.6170697Z ^ 2025-05-07T19:53:27.6170939Z 2025-05-07T19:53:27.6171429Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:27.6172059Z 2025-05-07T19:53:27.6173654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6176221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.6177401Z ^ 2025-05-07T19:53:27.6177762Z 2025-05-07T19:53:27.6179394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6182174Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.6183380Z ^ 2025-05-07T19:53:27.6183636Z 2025-05-07T19:53:27.6184075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:27.6184746Z 2025-05-07T19:53:27.6186429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6189001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.6190136Z ^ 2025-05-07T19:53:27.6190496Z 2025-05-07T19:53:27.6192125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6195035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.6196245Z ^ 2025-05-07T19:53:27.6196506Z 2025-05-07T19:53:27.6196989Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:27.6197831Z 2025-05-07T19:53:27.6199522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6202118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.6203379Z ^ 2025-05-07T19:53:27.6203764Z 2025-05-07T19:53:27.6205483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6208085Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.6209203Z ^ 2025-05-07T19:53:27.6209484Z 2025-05-07T19:53:27.6209925Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:27.6210575Z 2025-05-07T19:53:27.6212197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6214693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.6215855Z ^ 2025-05-07T19:53:27.6216223Z 2025-05-07T19:53:27.6217839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6220485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.6221618Z ^ 2025-05-07T19:53:27.6221868Z 2025-05-07T19:53:27.6222521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:27.6223192Z 2025-05-07T19:53:27.6224770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6227366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.6228504Z ^ 2025-05-07T19:53:27.6228875Z 2025-05-07T19:53:28.3280233Z [141/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:28.3304971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.3307864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.3309108Z ^ 2025-05-07T19:53:28.3309374Z 2025-05-07T19:53:28.3309842Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.3310520Z 2025-05-07T19:53:28.3312242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.3315048Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.3316278Z ^ 2025-05-07T19:53:28.3316664Z 2025-05-07T19:53:28.3318193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3320329Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:28.3321134Z ^ 2025-05-07T19:53:28.3321434Z 2025-05-07T19:53:28.3323536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3325482Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3325954Z ^ 2025-05-07T19:53:28.3326206Z 2025-05-07T19:53:28.3327656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3344135Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3344942Z ^ 2025-05-07T19:53:28.3345228Z 2025-05-07T19:53:28.3348116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3350258Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3350862Z ^ 2025-05-07T19:53:28.3351181Z 2025-05-07T19:53:28.3353107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.3356006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.3357297Z ^ 2025-05-07T19:53:28.3357592Z 2025-05-07T19:53:28.3358071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.3358783Z 2025-05-07T19:53:28.3360619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.3363335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.3364442Z ^ 2025-05-07T19:53:28.3364788Z 2025-05-07T19:53:28.3366271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3368347Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:28.3369060Z ^ 2025-05-07T19:53:28.3369340Z 2025-05-07T19:53:28.3370910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3372947Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3373556Z ^ 2025-05-07T19:53:28.3373846Z 2025-05-07T19:53:28.3375414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3377468Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3377991Z ^ 2025-05-07T19:53:28.3378311Z 2025-05-07T19:53:28.3380010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3382016Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3382596Z ^ 2025-05-07T19:53:28.3382903Z 2025-05-07T19:53:28.3384590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.3387368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.3388576Z ^ 2025-05-07T19:53:28.3388837Z 2025-05-07T19:53:28.3389333Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.3390276Z 2025-05-07T19:53:28.3391911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.3394639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.3395922Z ^ 2025-05-07T19:53:28.3396297Z 2025-05-07T19:53:28.3397829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3399794Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:28.3400462Z ^ 2025-05-07T19:53:28.3400721Z 2025-05-07T19:53:28.3401957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3403530Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3404018Z ^ 2025-05-07T19:53:28.3404290Z 2025-05-07T19:53:28.3405507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3407223Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3407736Z ^ 2025-05-07T19:53:28.3408009Z 2025-05-07T19:53:28.3409446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3411290Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3411798Z ^ 2025-05-07T19:53:28.3412056Z 2025-05-07T19:53:28.3413825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.3416419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.3417576Z ^ 2025-05-07T19:53:28.3417847Z 2025-05-07T19:53:28.3418326Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.3419056Z 2025-05-07T19:53:28.3420988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.3424185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.3425456Z ^ 2025-05-07T19:53:28.3425865Z 2025-05-07T19:53:28.3427491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3429772Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:28.3430562Z ^ 2025-05-07T19:53:28.3430898Z 2025-05-07T19:53:28.3432514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3434926Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3435516Z ^ 2025-05-07T19:53:28.3435814Z 2025-05-07T19:53:28.3437626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3439674Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3440286Z ^ 2025-05-07T19:53:28.3440577Z 2025-05-07T19:53:28.3442230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3444117Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3444678Z ^ 2025-05-07T19:53:28.3444955Z 2025-05-07T19:53:28.3446646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.3449428Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.3450688Z ^ 2025-05-07T19:53:28.3450952Z 2025-05-07T19:53:28.3451401Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.3452125Z 2025-05-07T19:53:28.3453859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.3456682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.3457920Z ^ 2025-05-07T19:53:28.3458295Z 2025-05-07T19:53:28.3460062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3462230Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:28.3462940Z ^ 2025-05-07T19:53:28.3463214Z 2025-05-07T19:53:28.3464735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3466543Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3467070Z ^ 2025-05-07T19:53:28.3467361Z 2025-05-07T19:53:28.3468848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3470644Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3471152Z ^ 2025-05-07T19:53:28.3471399Z 2025-05-07T19:53:28.3472702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.3474757Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.3475338Z ^ 2025-05-07T19:53:28.3475650Z 2025-05-07T19:53:31.8980972Z [142/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:53:31.9003211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:31.9006111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:31.9007276Z ^ 2025-05-07T19:53:31.9007541Z 2025-05-07T19:53:31.9008021Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:31.9008692Z 2025-05-07T19:53:31.9010367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:31.9013164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:31.9014445Z ^ 2025-05-07T19:53:31.9014824Z 2025-05-07T19:53:31.9016590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:31.9019684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:31.9020870Z ^ 2025-05-07T19:53:31.9021137Z 2025-05-07T19:53:31.9021591Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:31.9022597Z 2025-05-07T19:53:31.9024313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:31.9027570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:31.9029046Z ^ 2025-05-07T19:53:31.9029456Z 2025-05-07T19:53:31.9031115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:31.9033876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:31.9035078Z ^ 2025-05-07T19:53:31.9035312Z 2025-05-07T19:53:31.9035740Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:31.9036439Z 2025-05-07T19:53:31.9038156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:31.9041020Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:31.9042313Z ^ 2025-05-07T19:53:31.9042694Z 2025-05-07T19:53:31.9044362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:31.9047148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:31.9048364Z ^ 2025-05-07T19:53:31.9048620Z 2025-05-07T19:53:31.9049086Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:31.9049789Z 2025-05-07T19:53:31.9051580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:31.9054309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:31.9055493Z ^ 2025-05-07T19:53:31.9055833Z 2025-05-07T19:53:31.9057441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:31.9060221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:31.9061430Z ^ 2025-05-07T19:53:31.9061689Z 2025-05-07T19:53:31.9062190Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:31.9062884Z 2025-05-07T19:53:31.9064456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:31.9067201Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:31.9068421Z ^ 2025-05-07T19:53:31.9068782Z 2025-05-07T19:53:32.5631894Z [143/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:32.6099843Z [144/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:53:32.6124582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.6127501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.6128752Z ^ 2025-05-07T19:53:32.6129055Z 2025-05-07T19:53:32.6129524Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.6130205Z 2025-05-07T19:53:32.6131850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.6134641Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.6135709Z ^ 2025-05-07T19:53:32.6136088Z 2025-05-07T19:53:32.6137905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.6140885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.6142165Z ^ 2025-05-07T19:53:32.6142433Z 2025-05-07T19:53:32.6142876Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.6143545Z 2025-05-07T19:53:32.6145056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.6147690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.6148892Z ^ 2025-05-07T19:53:32.6149291Z 2025-05-07T19:53:32.6150998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.6153728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.6154939Z ^ 2025-05-07T19:53:32.6155227Z 2025-05-07T19:53:32.6155692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.6156769Z 2025-05-07T19:53:32.6158504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.6161463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.6162733Z ^ 2025-05-07T19:53:32.6163113Z 2025-05-07T19:53:32.6164829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.6167516Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.6168719Z ^ 2025-05-07T19:53:32.6168998Z 2025-05-07T19:53:32.6169471Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.6170183Z 2025-05-07T19:53:32.6171930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.6174735Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.6175996Z ^ 2025-05-07T19:53:32.6176400Z 2025-05-07T19:53:32.6178131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.6181078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.6182316Z ^ 2025-05-07T19:53:32.6182614Z 2025-05-07T19:53:32.6183085Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.6183790Z 2025-05-07T19:53:32.6185597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.6188434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.6189726Z ^ 2025-05-07T19:53:32.6190076Z 2025-05-07T19:53:33.1713437Z [145/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:34.0405310Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:53:34.0429807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:34.0432789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:34.0433992Z ^ 2025-05-07T19:53:34.0434299Z 2025-05-07T19:53:34.0435248Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:34.0435877Z 2025-05-07T19:53:34.0437532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:34.0440585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:34.0441772Z ^ 2025-05-07T19:53:34.0442133Z 2025-05-07T19:53:34.0443823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:34.0446723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:34.0447985Z ^ 2025-05-07T19:53:34.0448255Z 2025-05-07T19:53:34.0448741Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:34.0449411Z 2025-05-07T19:53:34.0451071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:34.0453841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:34.0455065Z ^ 2025-05-07T19:53:34.0455415Z 2025-05-07T19:53:34.0457003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:34.0459831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:34.0460997Z ^ 2025-05-07T19:53:34.0461241Z 2025-05-07T19:53:34.0461659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:34.0462295Z 2025-05-07T19:53:34.0463907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:34.0466522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:34.0467735Z ^ 2025-05-07T19:53:34.0468111Z 2025-05-07T19:53:34.0469795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:34.0472583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:34.0473851Z ^ 2025-05-07T19:53:34.0474116Z 2025-05-07T19:53:34.0474610Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:34.0475311Z 2025-05-07T19:53:34.0477070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:34.0479903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:34.0481427Z ^ 2025-05-07T19:53:34.0481795Z 2025-05-07T19:53:34.0483409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:34.0486374Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:34.0487611Z ^ 2025-05-07T19:53:34.0487900Z 2025-05-07T19:53:34.0488376Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:34.0489068Z 2025-05-07T19:53:34.0490831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:34.0493572Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:34.0494803Z ^ 2025-05-07T19:53:34.0495193Z 2025-05-07T19:53:39.7778434Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:39.7801876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7805236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.7806590Z ^ 2025-05-07T19:53:39.7806869Z 2025-05-07T19:53:39.7807349Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.7808411Z 2025-05-07T19:53:39.7810178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7813260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.7814609Z ^ 2025-05-07T19:53:39.7814965Z 2025-05-07T19:53:39.7816755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7819921Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.7821213Z ^ 2025-05-07T19:53:39.7821504Z 2025-05-07T19:53:39.7822235Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.7822893Z 2025-05-07T19:53:39.7824475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7827214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.7828481Z ^ 2025-05-07T19:53:39.7828855Z 2025-05-07T19:53:39.7830446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7833486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.7834745Z ^ 2025-05-07T19:53:39.7835021Z 2025-05-07T19:53:39.7835506Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.7836227Z 2025-05-07T19:53:39.7837940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7840789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.7841986Z ^ 2025-05-07T19:53:39.7842348Z 2025-05-07T19:53:39.7843944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7846469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.7847605Z ^ 2025-05-07T19:53:39.7847840Z 2025-05-07T19:53:39.7848288Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.7848921Z 2025-05-07T19:53:39.7850620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7853691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.7854843Z ^ 2025-05-07T19:53:39.7855215Z 2025-05-07T19:53:39.7856994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7859970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.7861130Z ^ 2025-05-07T19:53:39.7861426Z 2025-05-07T19:53:39.7861897Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.7862579Z 2025-05-07T19:53:39.7864318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7866965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.7868130Z ^ 2025-05-07T19:53:39.7868520Z 2025-05-07T19:53:39.8066098Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:39.8090436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.8093579Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.8094594Z ^ 2025-05-07T19:53:39.8094857Z 2025-05-07T19:53:39.8095330Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.8095942Z 2025-05-07T19:53:39.8097562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.8100368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.8101534Z ^ 2025-05-07T19:53:39.8101880Z 2025-05-07T19:53:39.8103445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.8105971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.8107114Z ^ 2025-05-07T19:53:39.8107358Z 2025-05-07T19:53:39.8107780Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.8108433Z 2025-05-07T19:53:39.8109976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.8112549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.8113677Z ^ 2025-05-07T19:53:39.8114011Z 2025-05-07T19:53:39.8115582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.8118092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.8119308Z ^ 2025-05-07T19:53:39.8119570Z 2025-05-07T19:53:39.8120050Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.8120728Z 2025-05-07T19:53:39.8122850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.8125662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.8126876Z ^ 2025-05-07T19:53:39.8127213Z 2025-05-07T19:53:39.8128742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.8131287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.8132396Z ^ 2025-05-07T19:53:39.8134891Z 2025-05-07T19:53:39.8135293Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.8135897Z 2025-05-07T19:53:39.8137487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.8140687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.8141850Z ^ 2025-05-07T19:53:39.8142173Z 2025-05-07T19:53:39.8143693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.8146341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.8147494Z ^ 2025-05-07T19:53:39.8147757Z 2025-05-07T19:53:39.8148222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.8148905Z 2025-05-07T19:53:39.8150620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.8153471Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.8154717Z ^ 2025-05-07T19:53:39.8155127Z 2025-05-07T19:53:52.5932586Z [149/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:53:52.5956591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.5959176Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:52.5960263Z ^ 2025-05-07T19:53:52.5960507Z 2025-05-07T19:53:52.5960972Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:52.5961620Z 2025-05-07T19:53:52.5963235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.5965837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:52.5967027Z ^ 2025-05-07T19:53:52.5967377Z 2025-05-07T19:53:52.5968951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.5971550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:52.5972671Z ^ 2025-05-07T19:53:52.5972910Z 2025-05-07T19:53:52.5973339Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:52.5973983Z 2025-05-07T19:53:52.5975566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.5977941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:52.5978981Z ^ 2025-05-07T19:53:52.5979496Z 2025-05-07T19:53:52.5981101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.5983764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:52.5984978Z ^ 2025-05-07T19:53:52.5985212Z 2025-05-07T19:53:52.5985634Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:52.5986282Z 2025-05-07T19:53:52.5988023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.5990707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:52.5991882Z ^ 2025-05-07T19:53:52.5992240Z 2025-05-07T19:53:52.5993894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.5996849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:52.5997965Z ^ 2025-05-07T19:53:52.5998210Z 2025-05-07T19:53:52.5998729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:52.5999333Z 2025-05-07T19:53:52.6000954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.6003649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:52.6004896Z ^ 2025-05-07T19:53:52.6005266Z 2025-05-07T19:53:52.6007020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.6009585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:52.6010665Z ^ 2025-05-07T19:53:52.6010892Z 2025-05-07T19:53:52.6011350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:52.6011988Z 2025-05-07T19:53:52.6013642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.6016479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:52.6017729Z ^ 2025-05-07T19:53:52.6018115Z 2025-05-07T19:53:54.7281528Z [150/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:53:54.7304735Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:58.0519980Z [151/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:53:58.0537295Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:02.0968596Z [152/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:02.0991004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.0993501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:02.0994621Z ^ 2025-05-07T19:54:02.0994844Z 2025-05-07T19:54:02.0995245Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:02.0995852Z 2025-05-07T19:54:02.0997349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.0999826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:02.1000924Z ^ 2025-05-07T19:54:02.1001267Z 2025-05-07T19:54:02.1002791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.1005317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:02.1006427Z ^ 2025-05-07T19:54:02.1006657Z 2025-05-07T19:54:02.1007067Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:02.1007716Z 2025-05-07T19:54:02.1009300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.1011785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:02.1012924Z ^ 2025-05-07T19:54:02.1013293Z 2025-05-07T19:54:02.1014880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.1017424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:02.1018532Z ^ 2025-05-07T19:54:02.1019139Z 2025-05-07T19:54:02.1019783Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:02.1020357Z 2025-05-07T19:54:02.1022197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.1024601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:02.1025665Z ^ 2025-05-07T19:54:02.1026029Z 2025-05-07T19:54:02.1027527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.1029997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:02.1031077Z ^ 2025-05-07T19:54:02.1031351Z 2025-05-07T19:54:02.1031760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:02.1032372Z 2025-05-07T19:54:02.1033915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.1036430Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:02.1037520Z ^ 2025-05-07T19:54:02.1037849Z 2025-05-07T19:54:02.1039400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.1041829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:02.1042924Z ^ 2025-05-07T19:54:02.1043173Z 2025-05-07T19:54:02.1043617Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:02.1044259Z 2025-05-07T19:54:02.1045705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.1048223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:02.1049242Z ^ 2025-05-07T19:54:02.1049605Z 2025-05-07T19:54:03.4841744Z [153/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:54:03.4860609Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:07.2240416Z [154/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:07.2261067Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:10.5051213Z [155/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:54:10.5070896Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:15.1435832Z [156/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:15.1456867Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:18.9580900Z [157/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:54:18.9600976Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.9501781Z [158/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:19.9522239Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.0345685Z [159/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:20.0369004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.0371498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.0372696Z ^ 2025-05-07T19:54:20.0372949Z 2025-05-07T19:54:20.0373920Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.0374588Z 2025-05-07T19:54:20.0376225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.0379143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.0380452Z ^ 2025-05-07T19:54:20.0380776Z 2025-05-07T19:54:20.0382356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.0384984Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.0386173Z ^ 2025-05-07T19:54:20.0386439Z 2025-05-07T19:54:20.0386881Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.0387538Z 2025-05-07T19:54:20.0389218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.0391959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.0393153Z ^ 2025-05-07T19:54:20.0393512Z 2025-05-07T19:54:20.0395194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.0397770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.0398842Z ^ 2025-05-07T19:54:20.0399092Z 2025-05-07T19:54:20.0399525Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.0400214Z 2025-05-07T19:54:20.0401843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.0404547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.0405675Z ^ 2025-05-07T19:54:20.0406024Z 2025-05-07T19:54:20.0407557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.0410249Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.0411438Z ^ 2025-05-07T19:54:20.0411695Z 2025-05-07T19:54:20.0412119Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.0412749Z 2025-05-07T19:54:20.0414375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.0416966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.0418146Z ^ 2025-05-07T19:54:20.0418858Z 2025-05-07T19:54:20.0420669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.0423933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.0425122Z ^ 2025-05-07T19:54:20.0425389Z 2025-05-07T19:54:20.0425831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.0426497Z 2025-05-07T19:54:20.0428209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.0430857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.0432053Z ^ 2025-05-07T19:54:20.0432401Z 2025-05-07T19:54:22.1494618Z [160/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:54:22.1514395Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:23.2642692Z [161/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:23.2663066Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:23.5330037Z [162/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:54:23.5350492Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:27.2708808Z [163/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:27.2731641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.2734476Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.2735654Z ^ 2025-05-07T19:54:27.2735922Z 2025-05-07T19:54:27.2736408Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.2737028Z 2025-05-07T19:54:27.2738604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.2741171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.2742213Z ^ 2025-05-07T19:54:27.2742520Z 2025-05-07T19:54:27.2744042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.2747054Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.2748376Z ^ 2025-05-07T19:54:27.2748629Z 2025-05-07T19:54:27.2749061Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.2749709Z 2025-05-07T19:54:27.2751364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.2753935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.2755150Z ^ 2025-05-07T19:54:27.2755508Z 2025-05-07T19:54:27.2757219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.2759821Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.2761009Z ^ 2025-05-07T19:54:27.2761257Z 2025-05-07T19:54:27.2761709Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.2762359Z 2025-05-07T19:54:27.2764027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.2766649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.2767796Z ^ 2025-05-07T19:54:27.2768196Z 2025-05-07T19:54:27.2769849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.2772564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.2773586Z ^ 2025-05-07T19:54:27.2773827Z 2025-05-07T19:54:27.2774231Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.2774839Z 2025-05-07T19:54:27.2776446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.2779510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.2780754Z ^ 2025-05-07T19:54:27.2781115Z 2025-05-07T19:54:27.2782774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.2785475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.2786677Z ^ 2025-05-07T19:54:27.2786940Z 2025-05-07T19:54:27.2787400Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.2788410Z 2025-05-07T19:54:27.2790114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.2792880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.2794112Z ^ 2025-05-07T19:54:27.2794494Z 2025-05-07T19:54:27.8005453Z [164/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:27.8026552Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:27.8360657Z [165/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:27.8380946Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:27.8711016Z [166/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:27.8730958Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:27.9072839Z [167/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:27.9091618Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:27.9424791Z [168/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:27.9446610Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:27.9775779Z [169/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:27.9796404Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:28.0127098Z [170/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:28.0147117Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:28.0481833Z [171/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:28.0502781Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:28.0829520Z [172/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:28.0848874Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:28.1184479Z [173/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:28.1205786Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:28.1533659Z [174/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:28.1554168Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:28.1881721Z [175/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:28.1900639Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:28.2238204Z [176/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:28.2258608Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:28.2591211Z [177/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:28.2612589Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.8624468Z [178/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:30.8647956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8650713Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.8651946Z ^ 2025-05-07T19:54:30.8652212Z 2025-05-07T19:54:30.8652686Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:30.8653425Z 2025-05-07T19:54:30.8655163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8657910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.8659113Z ^ 2025-05-07T19:54:30.8659672Z 2025-05-07T19:54:30.8661332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8664221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.8665433Z ^ 2025-05-07T19:54:30.8665710Z 2025-05-07T19:54:30.8666214Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:30.8666929Z 2025-05-07T19:54:30.8668438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8670778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.8671921Z ^ 2025-05-07T19:54:30.8672258Z 2025-05-07T19:54:30.8673877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8676557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.8677776Z ^ 2025-05-07T19:54:30.8678032Z 2025-05-07T19:54:30.8678476Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:30.8679128Z 2025-05-07T19:54:30.8680752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8683393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.8684958Z ^ 2025-05-07T19:54:30.8685336Z 2025-05-07T19:54:30.8690260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8692958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.8694003Z ^ 2025-05-07T19:54:30.8694241Z 2025-05-07T19:54:30.8694714Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:30.8695414Z 2025-05-07T19:54:30.8696930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8699183Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.8700384Z ^ 2025-05-07T19:54:30.8700752Z 2025-05-07T19:54:30.8702293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8705165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.8706295Z ^ 2025-05-07T19:54:30.8706578Z 2025-05-07T19:54:30.8707001Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:30.8707616Z 2025-05-07T19:54:30.8709305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8712113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.8713383Z ^ 2025-05-07T19:54:30.8713760Z 2025-05-07T19:54:31.2183315Z [179/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:31.2205220Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:32.2226487Z [180/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:54:32.2245591Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:32.6365319Z [181/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:32.6385963Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:35.9600896Z [182/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:35.9621194Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:36.6580937Z [183/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:36.6597648Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:37.6205172Z [184/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:37.6226246Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:37.7162511Z [185/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:37.7185982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.7188518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.7189600Z ^ 2025-05-07T19:54:37.7189835Z 2025-05-07T19:54:37.7190254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.7191355Z 2025-05-07T19:54:37.7192869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.7195757Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.7196899Z ^ 2025-05-07T19:54:37.7197253Z 2025-05-07T19:54:37.7198446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7200212Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7200976Z ^ 2025-05-07T19:54:37.7204534Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:37.7207906Z 2025-05-07T19:54:37.7209233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7211147Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7211972Z ^ 2025-05-07T19:54:37.7215245Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:37.7218304Z 2025-05-07T19:54:37.7219732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7221704Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7222868Z ^ 2025-05-07T19:54:37.7226437Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:37.7229676Z 2025-05-07T19:54:37.7231005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7233061Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7233861Z ^ 2025-05-07T19:54:37.7237589Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:37.7241066Z 2025-05-07T19:54:37.7242545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7244571Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7245455Z ^ 2025-05-07T19:54:37.7248991Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:37.7252368Z 2025-05-07T19:54:37.7253713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7255761Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7256730Z ^ 2025-05-07T19:54:37.7260634Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:37.7263975Z 2025-05-07T19:54:37.7265381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7267509Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7268469Z ^ 2025-05-07T19:54:37.7272274Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:37.7275604Z 2025-05-07T19:54:37.7276911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7278860Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7279750Z ^ 2025-05-07T19:54:37.7283321Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:37.7287020Z 2025-05-07T19:54:37.7288528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7290738Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7291738Z ^ 2025-05-07T19:54:37.7295360Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:37.7298929Z 2025-05-07T19:54:37.7300575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7302702Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7303482Z ^ 2025-05-07T19:54:37.7306809Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:37.7310144Z 2025-05-07T19:54:37.7311429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7313366Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7314208Z ^ 2025-05-07T19:54:37.7317671Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:37.7321080Z 2025-05-07T19:54:37.7322742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7324917Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7325889Z ^ 2025-05-07T19:54:37.7329573Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:37.7333514Z 2025-05-07T19:54:37.7334930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7337040Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7338004Z ^ 2025-05-07T19:54:37.7341715Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:37.7345252Z 2025-05-07T19:54:37.7346682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7348705Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7349644Z ^ 2025-05-07T19:54:37.7353186Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:37.7356552Z 2025-05-07T19:54:37.7357973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7360049Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7361026Z ^ 2025-05-07T19:54:37.7364748Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:37.7368135Z 2025-05-07T19:54:37.7369476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7371602Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7372405Z ^ 2025-05-07T19:54:37.7375826Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:37.7379570Z 2025-05-07T19:54:37.7380914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7383246Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7384185Z ^ 2025-05-07T19:54:37.7387661Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:37.7391073Z 2025-05-07T19:54:37.7392470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7394653Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7395579Z ^ 2025-05-07T19:54:37.7399239Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:37.7402716Z 2025-05-07T19:54:37.7404187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7406451Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7407443Z ^ 2025-05-07T19:54:37.7411348Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:37.7415113Z 2025-05-07T19:54:37.7416551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7418718Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7419854Z ^ 2025-05-07T19:54:37.7423743Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:37.7427704Z 2025-05-07T19:54:37.7428943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7431193Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7432163Z ^ 2025-05-07T19:54:37.7435722Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:37.7438910Z 2025-05-07T19:54:37.7440178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7442324Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7443306Z ^ 2025-05-07T19:54:37.7446968Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:37.7450289Z 2025-05-07T19:54:37.7451665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7453821Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7454813Z ^ 2025-05-07T19:54:37.7458707Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:37.7461983Z 2025-05-07T19:54:37.7463285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7465223Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7466121Z ^ 2025-05-07T19:54:37.7469711Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:37.7473570Z 2025-05-07T19:54:37.7475394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.7478300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.7479602Z ^ 2025-05-07T19:54:37.7479866Z 2025-05-07T19:54:37.7480305Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.7480941Z 2025-05-07T19:54:37.7482675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.7485449Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.7486655Z ^ 2025-05-07T19:54:37.7487033Z 2025-05-07T19:54:37.7488413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7490551Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7491426Z ^ 2025-05-07T19:54:37.7494957Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:37.7498388Z 2025-05-07T19:54:37.7499928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7502292Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7503198Z ^ 2025-05-07T19:54:37.7506869Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:37.7510407Z 2025-05-07T19:54:37.7511813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7513989Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7514933Z ^ 2025-05-07T19:54:37.7518449Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:37.7522472Z 2025-05-07T19:54:37.7523863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7526122Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7526960Z ^ 2025-05-07T19:54:37.7530408Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:37.7534050Z 2025-05-07T19:54:37.7535354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7537415Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7538389Z ^ 2025-05-07T19:54:37.7541921Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:37.7544997Z 2025-05-07T19:54:37.7546239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7548234Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7549113Z ^ 2025-05-07T19:54:37.7552533Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:37.7555716Z 2025-05-07T19:54:37.7557004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7559011Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7559957Z ^ 2025-05-07T19:54:37.7563649Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:37.7567369Z 2025-05-07T19:54:37.7568664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7570646Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7571677Z ^ 2025-05-07T19:54:37.7575190Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:37.7578477Z 2025-05-07T19:54:37.7580194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7582271Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7583235Z ^ 2025-05-07T19:54:37.7586701Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:37.7589929Z 2025-05-07T19:54:37.7591405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7593538Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7594503Z ^ 2025-05-07T19:54:37.7598178Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:37.7601708Z 2025-05-07T19:54:37.7603096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7605191Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7606107Z ^ 2025-05-07T19:54:37.7609746Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:37.7612933Z 2025-05-07T19:54:37.7614171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7616383Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7617311Z ^ 2025-05-07T19:54:37.7623693Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:37.7627192Z 2025-05-07T19:54:37.7628495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7630605Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7631579Z ^ 2025-05-07T19:54:37.7635376Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:37.7638572Z 2025-05-07T19:54:37.7639854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7641925Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7642842Z ^ 2025-05-07T19:54:37.7646511Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:37.7649955Z 2025-05-07T19:54:37.7651289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7653361Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7654332Z ^ 2025-05-07T19:54:37.7658223Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:37.7661997Z 2025-05-07T19:54:37.7663400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7665948Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7666817Z ^ 2025-05-07T19:54:37.7670853Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:37.7674218Z 2025-05-07T19:54:37.7675523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7677425Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7678264Z ^ 2025-05-07T19:54:37.7681988Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:37.7685382Z 2025-05-07T19:54:37.7686605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7688715Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7689636Z ^ 2025-05-07T19:54:37.7693489Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:37.7697169Z 2025-05-07T19:54:37.7698353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7700478Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7701372Z ^ 2025-05-07T19:54:37.7704933Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:37.7708314Z 2025-05-07T19:54:37.7709697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7712093Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7713045Z ^ 2025-05-07T19:54:37.7716835Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:37.7737722Z 2025-05-07T19:54:37.7739165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7741321Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7742282Z ^ 2025-05-07T19:54:37.7746110Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:37.7749735Z 2025-05-07T19:54:37.7751157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7753328Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7754213Z ^ 2025-05-07T19:54:37.7757707Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:37.7761308Z 2025-05-07T19:54:37.7762625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7764572Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7765477Z ^ 2025-05-07T19:54:37.7769074Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:37.7772450Z 2025-05-07T19:54:37.7773805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7776411Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7777371Z ^ 2025-05-07T19:54:37.7781125Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:37.7784404Z 2025-05-07T19:54:37.7786087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.7788798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.7789979Z ^ 2025-05-07T19:54:37.7790217Z 2025-05-07T19:54:37.7790635Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.7791330Z 2025-05-07T19:54:37.7792881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.7795599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.7796773Z ^ 2025-05-07T19:54:37.7797125Z 2025-05-07T19:54:37.7798469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7800509Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7801694Z ^ 2025-05-07T19:54:37.7805522Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:37.7809019Z 2025-05-07T19:54:37.7810324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7812403Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7813305Z ^ 2025-05-07T19:54:37.7816824Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:37.7820711Z 2025-05-07T19:54:37.7822255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7824866Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7825803Z ^ 2025-05-07T19:54:37.7829755Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:37.7833030Z 2025-05-07T19:54:37.7834406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7836319Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7837218Z ^ 2025-05-07T19:54:37.7840717Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:37.7844133Z 2025-05-07T19:54:37.7845474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7847515Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7848434Z ^ 2025-05-07T19:54:37.7852263Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:37.7855515Z 2025-05-07T19:54:37.7856838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7858893Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7859788Z ^ 2025-05-07T19:54:37.7863368Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:37.7866676Z 2025-05-07T19:54:37.7867990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7870214Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7871148Z ^ 2025-05-07T19:54:37.7874821Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:37.7878270Z 2025-05-07T19:54:37.7879642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7881568Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7882504Z ^ 2025-05-07T19:54:37.7886151Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:37.7889554Z 2025-05-07T19:54:37.7890937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7892969Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7893898Z ^ 2025-05-07T19:54:37.7897202Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:37.7900621Z 2025-05-07T19:54:37.7901995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7904128Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7905093Z ^ 2025-05-07T19:54:37.7908813Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:37.7912181Z 2025-05-07T19:54:37.7913559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7915606Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7916840Z ^ 2025-05-07T19:54:37.7920741Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:37.7924631Z 2025-05-07T19:54:37.7925966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7928211Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7929132Z ^ 2025-05-07T19:54:37.7932775Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:37.7936287Z 2025-05-07T19:54:37.7937537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7939775Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7940963Z ^ 2025-05-07T19:54:37.7944706Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:37.7948181Z 2025-05-07T19:54:37.7949427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7951402Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7952292Z ^ 2025-05-07T19:54:37.7955947Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:37.7959552Z 2025-05-07T19:54:37.7960958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7963068Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7964427Z ^ 2025-05-07T19:54:37.7968080Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:37.7971631Z 2025-05-07T19:54:37.7973081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7975270Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7976240Z ^ 2025-05-07T19:54:37.7980192Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:37.7983802Z 2025-05-07T19:54:37.7985242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7987425Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7988404Z ^ 2025-05-07T19:54:37.7992046Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:37.7995391Z 2025-05-07T19:54:37.7996661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.7998630Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.7999546Z ^ 2025-05-07T19:54:37.8003149Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:37.8006525Z 2025-05-07T19:54:37.8007891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8010016Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8010986Z ^ 2025-05-07T19:54:37.8015159Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:37.8018690Z 2025-05-07T19:54:37.8020226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8022721Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8023620Z ^ 2025-05-07T19:54:37.8027423Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:37.8030796Z 2025-05-07T19:54:37.8032116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8034017Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8034923Z ^ 2025-05-07T19:54:37.8038800Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:37.8042506Z 2025-05-07T19:54:37.8043796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8045847Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8046793Z ^ 2025-05-07T19:54:37.8050938Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:37.8054536Z 2025-05-07T19:54:37.8055900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8058024Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8058960Z ^ 2025-05-07T19:54:37.8063681Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:37.8067298Z 2025-05-07T19:54:37.8068599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8070877Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8071840Z ^ 2025-05-07T19:54:37.8075815Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:37.8079571Z 2025-05-07T19:54:37.8081384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8084168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8085456Z ^ 2025-05-07T19:54:37.8085732Z 2025-05-07T19:54:37.8086210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.8086929Z 2025-05-07T19:54:37.8088773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8091520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8092823Z ^ 2025-05-07T19:54:37.8093232Z 2025-05-07T19:54:37.8094618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8096693Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8097689Z ^ 2025-05-07T19:54:37.8101672Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:37.8104983Z 2025-05-07T19:54:37.8106328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8108293Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8109561Z ^ 2025-05-07T19:54:37.8113577Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:37.8117118Z 2025-05-07T19:54:37.8118502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8120540Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8121469Z ^ 2025-05-07T19:54:37.8125503Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:37.8129145Z 2025-05-07T19:54:37.8130579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8132704Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8133566Z ^ 2025-05-07T19:54:37.8136955Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:37.8140352Z 2025-05-07T19:54:37.8141685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8143700Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8144594Z ^ 2025-05-07T19:54:37.8148020Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:37.8151296Z 2025-05-07T19:54:37.8152566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8154624Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8155586Z ^ 2025-05-07T19:54:37.8160028Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:37.8163595Z 2025-05-07T19:54:37.8164969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8167005Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8167922Z ^ 2025-05-07T19:54:37.8171492Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:37.8174859Z 2025-05-07T19:54:37.8176294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8178254Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8179198Z ^ 2025-05-07T19:54:37.8183123Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:37.8186646Z 2025-05-07T19:54:37.8188032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8190147Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8191008Z ^ 2025-05-07T19:54:37.8194616Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:37.8198087Z 2025-05-07T19:54:37.8199439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8201483Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8202355Z ^ 2025-05-07T19:54:37.8206275Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:37.8209760Z 2025-05-07T19:54:37.8211054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8213132Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8214016Z ^ 2025-05-07T19:54:37.8217233Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:37.8220644Z 2025-05-07T19:54:37.8221928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8224037Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8224911Z ^ 2025-05-07T19:54:37.8228393Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:37.8231631Z 2025-05-07T19:54:37.8233003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8235038Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8235866Z ^ 2025-05-07T19:54:37.8239327Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:37.8242686Z 2025-05-07T19:54:37.8244040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8246127Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8247077Z ^ 2025-05-07T19:54:37.8250495Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:37.8257395Z 2025-05-07T19:54:37.8258792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8261057Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8261984Z ^ 2025-05-07T19:54:37.8265632Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:37.8269124Z 2025-05-07T19:54:37.8270443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8272525Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8273476Z ^ 2025-05-07T19:54:37.8277275Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:37.8280776Z 2025-05-07T19:54:37.8282327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8284298Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8285190Z ^ 2025-05-07T19:54:37.8288775Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:37.8292075Z 2025-05-07T19:54:37.8293387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8295456Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8296294Z ^ 2025-05-07T19:54:37.8300183Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:37.8303869Z 2025-05-07T19:54:37.8305255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8307230Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8308115Z ^ 2025-05-07T19:54:37.8312085Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:37.8315704Z 2025-05-07T19:54:37.8317079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8319167Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8320025Z ^ 2025-05-07T19:54:37.8323948Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:37.8327671Z 2025-05-07T19:54:37.8329141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8331554Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8332486Z ^ 2025-05-07T19:54:37.8336306Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:37.8340108Z 2025-05-07T19:54:37.8341768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8343947Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8344912Z ^ 2025-05-07T19:54:37.8348381Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:37.8352180Z 2025-05-07T19:54:37.8353666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8355703Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8356654Z ^ 2025-05-07T19:54:37.8360294Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:37.8363845Z 2025-05-07T19:54:37.8365263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8367456Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8368427Z ^ 2025-05-07T19:54:37.8372358Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:37.8376023Z 2025-05-07T19:54:37.8377861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8380899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8382469Z ^ 2025-05-07T19:54:37.8382757Z 2025-05-07T19:54:37.8383255Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.8384007Z 2025-05-07T19:54:37.8385804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8388743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8389924Z ^ 2025-05-07T19:54:37.8390465Z 2025-05-07T19:54:37.8391876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8394015Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8395021Z ^ 2025-05-07T19:54:37.8399002Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:37.8402675Z 2025-05-07T19:54:37.8404247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8406602Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8407597Z ^ 2025-05-07T19:54:37.8411413Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:37.8414932Z 2025-05-07T19:54:37.8416350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8418553Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8419663Z ^ 2025-05-07T19:54:37.8423764Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:37.8427215Z 2025-05-07T19:54:37.8428637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8430832Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8431808Z ^ 2025-05-07T19:54:37.8435733Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:37.8439298Z 2025-05-07T19:54:37.8440648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8442737Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8443702Z ^ 2025-05-07T19:54:37.8447442Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:37.8451325Z 2025-05-07T19:54:37.8452857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8455034Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8456019Z ^ 2025-05-07T19:54:37.8459924Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:37.8463261Z 2025-05-07T19:54:37.8464556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8466434Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8467350Z ^ 2025-05-07T19:54:37.8471118Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:37.8474557Z 2025-05-07T19:54:37.8475934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8477887Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8478786Z ^ 2025-05-07T19:54:37.8482450Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:37.8485988Z 2025-05-07T19:54:37.8487399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8489440Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8490315Z ^ 2025-05-07T19:54:37.8493479Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:37.8496978Z 2025-05-07T19:54:37.8498301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8500349Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8501183Z ^ 2025-05-07T19:54:37.8504523Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:37.8507793Z 2025-05-07T19:54:37.8509128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8511234Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8512169Z ^ 2025-05-07T19:54:37.8515751Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:37.8519001Z 2025-05-07T19:54:37.8520435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8522555Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8523469Z ^ 2025-05-07T19:54:37.8526976Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:37.8530405Z 2025-05-07T19:54:37.8531900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8533898Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8534812Z ^ 2025-05-07T19:54:37.8538212Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:37.8542068Z 2025-05-07T19:54:37.8543503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8545906Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8546882Z ^ 2025-05-07T19:54:37.8550521Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:37.8554002Z 2025-05-07T19:54:37.8555449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8557431Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8558320Z ^ 2025-05-07T19:54:37.8561790Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:37.8564799Z 2025-05-07T19:54:37.8566041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8567971Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8568907Z ^ 2025-05-07T19:54:37.8572527Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:37.8575861Z 2025-05-07T19:54:37.8577148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8579123Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8580199Z ^ 2025-05-07T19:54:37.8584010Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:37.8587756Z 2025-05-07T19:54:37.8589011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8591113Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8592069Z ^ 2025-05-07T19:54:37.8595806Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:37.8599263Z 2025-05-07T19:54:37.8600618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8602726Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8603785Z ^ 2025-05-07T19:54:37.8607672Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:37.8611083Z 2025-05-07T19:54:37.8612469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8614621Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8615502Z ^ 2025-05-07T19:54:37.8619060Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:37.8622909Z 2025-05-07T19:54:37.8624207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8626215Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8627093Z ^ 2025-05-07T19:54:37.8630636Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:37.8634619Z 2025-05-07T19:54:37.8635948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8637895Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8639027Z ^ 2025-05-07T19:54:37.8642911Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:37.8646422Z 2025-05-07T19:54:37.8647793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8649682Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8650534Z ^ 2025-05-07T19:54:37.8654029Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:37.8657244Z 2025-05-07T19:54:37.8658561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:37.8660840Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:37.8661814Z ^ 2025-05-07T19:54:37.8665590Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:37.8669016Z 2025-05-07T19:54:37.8762453Z [186/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:37.8784929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8787664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8788802Z ^ 2025-05-07T19:54:37.8789100Z 2025-05-07T19:54:37.8789536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.8790178Z 2025-05-07T19:54:37.8791720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8794409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8795657Z ^ 2025-05-07T19:54:37.8796030Z 2025-05-07T19:54:37.8797817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8800529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8801780Z ^ 2025-05-07T19:54:37.8802033Z 2025-05-07T19:54:37.8802491Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.8803164Z 2025-05-07T19:54:37.8804904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8807744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8808974Z ^ 2025-05-07T19:54:37.8809371Z 2025-05-07T19:54:37.8810987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8814026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8815210Z ^ 2025-05-07T19:54:37.8815477Z 2025-05-07T19:54:37.8816086Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.8816745Z 2025-05-07T19:54:37.8818287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8821129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8822525Z ^ 2025-05-07T19:54:37.8822873Z 2025-05-07T19:54:37.8824551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8827051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8828188Z ^ 2025-05-07T19:54:37.8828443Z 2025-05-07T19:54:37.8828858Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.8829478Z 2025-05-07T19:54:37.8831077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8833900Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8835243Z ^ 2025-05-07T19:54:37.8835618Z 2025-05-07T19:54:37.8837204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8839783Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8841033Z ^ 2025-05-07T19:54:37.8841280Z 2025-05-07T19:54:37.8841739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.8842425Z 2025-05-07T19:54:37.8844211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.8847138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.8848407Z ^ 2025-05-07T19:54:37.8848796Z 2025-05-07T19:54:40.7282280Z [187/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:40.7300504Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:40.7593062Z [188/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.7611255Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:40.7901497Z [189/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.7921650Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:40.8207980Z [190/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.8228737Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:40.8514841Z [191/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.8532921Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:40.8822403Z [192/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.8840297Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:40.9126643Z [193/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.9144641Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:45.3325894Z [194/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:45.3344972Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:46.8933263Z [195/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:46.8956151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8958718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8959807Z ^ 2025-05-07T19:54:46.8960025Z 2025-05-07T19:54:46.8960433Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.8961060Z 2025-05-07T19:54:46.8962536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8964967Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8966151Z ^ 2025-05-07T19:54:46.8966509Z 2025-05-07T19:54:46.8968189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8970862Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8971983Z ^ 2025-05-07T19:54:46.8972239Z 2025-05-07T19:54:46.8972691Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.8973303Z 2025-05-07T19:54:46.8974781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8977311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8978538Z ^ 2025-05-07T19:54:46.8978909Z 2025-05-07T19:54:46.8980748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8983125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8984133Z ^ 2025-05-07T19:54:46.8984342Z 2025-05-07T19:54:46.8984737Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.8985673Z 2025-05-07T19:54:46.8987171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8989688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8990756Z ^ 2025-05-07T19:54:46.8991092Z 2025-05-07T19:54:46.8992605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8995273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8996478Z ^ 2025-05-07T19:54:46.8996729Z 2025-05-07T19:54:46.8997194Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.8997861Z 2025-05-07T19:54:46.8999523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.9002223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.9003407Z ^ 2025-05-07T19:54:46.9003779Z 2025-05-07T19:54:46.9005512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.9007925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.9009090Z ^ 2025-05-07T19:54:46.9009332Z 2025-05-07T19:54:46.9009778Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.9010373Z 2025-05-07T19:54:46.9011910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.9014524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.9015739Z ^ 2025-05-07T19:54:46.9016094Z 2025-05-07T19:54:47.0522672Z [196/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:47.0546414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.0549282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.0550376Z ^ 2025-05-07T19:54:47.0550639Z 2025-05-07T19:54:47.0551038Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:47.0551665Z 2025-05-07T19:54:47.0553246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.0555868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.0557108Z ^ 2025-05-07T19:54:47.0557480Z 2025-05-07T19:54:47.0559224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.0561900Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.0563031Z ^ 2025-05-07T19:54:47.0563270Z 2025-05-07T19:54:47.0563721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:47.0564376Z 2025-05-07T19:54:47.0566131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.0568950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.0570157Z ^ 2025-05-07T19:54:47.0570548Z 2025-05-07T19:54:47.0572023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.0578674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.0579850Z ^ 2025-05-07T19:54:47.0580085Z 2025-05-07T19:54:47.0580464Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:47.0581065Z 2025-05-07T19:54:47.0582932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.0585581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.0586799Z ^ 2025-05-07T19:54:47.0587169Z 2025-05-07T19:54:47.0588859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.0591441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.0592496Z ^ 2025-05-07T19:54:47.0592731Z 2025-05-07T19:54:47.0593174Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:47.0593755Z 2025-05-07T19:54:47.0595203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.0597786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.0598892Z ^ 2025-05-07T19:54:47.0599247Z 2025-05-07T19:54:47.0600913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.0603646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.0604715Z ^ 2025-05-07T19:54:47.0604940Z 2025-05-07T19:54:47.0605380Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:47.0606010Z 2025-05-07T19:54:47.0607494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.0610059Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.0611208Z ^ 2025-05-07T19:54:47.0611571Z 2025-05-07T19:54:48.6334304Z [197/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:48.6354288Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:48.6651591Z [198/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:48.6675067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.6678005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.6679185Z ^ 2025-05-07T19:54:48.6679469Z 2025-05-07T19:54:48.6679925Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:48.6680608Z 2025-05-07T19:54:48.6682351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.6685044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.6686107Z ^ 2025-05-07T19:54:48.6686440Z 2025-05-07T19:54:48.6687977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.6690603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.6691777Z ^ 2025-05-07T19:54:48.6692028Z 2025-05-07T19:54:48.6692475Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:48.6693155Z 2025-05-07T19:54:48.6694863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.6697582Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.6698763Z ^ 2025-05-07T19:54:48.6699140Z 2025-05-07T19:54:48.6700881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.6703410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.6704478Z ^ 2025-05-07T19:54:48.6704701Z 2025-05-07T19:54:48.6705086Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:48.6705721Z 2025-05-07T19:54:48.6707388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.6710115Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.6711252Z ^ 2025-05-07T19:54:48.6711596Z 2025-05-07T19:54:48.6713202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.6715940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.6717162Z ^ 2025-05-07T19:54:48.6717693Z 2025-05-07T19:54:48.6718156Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:48.6718774Z 2025-05-07T19:54:48.6720450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.6723481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.6724651Z ^ 2025-05-07T19:54:48.6725041Z 2025-05-07T19:54:48.6726618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.6728851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.6730047Z ^ 2025-05-07T19:54:48.6730303Z 2025-05-07T19:54:48.6730756Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:48.6731371Z 2025-05-07T19:54:48.6732907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.6735750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.6737015Z ^ 2025-05-07T19:54:48.6737396Z 2025-05-07T19:54:48.7109036Z [199/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:48.7132639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.7135366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.7136402Z ^ 2025-05-07T19:54:48.7136664Z 2025-05-07T19:54:48.7137123Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:48.7137796Z 2025-05-07T19:54:48.7139296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.7142099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.7143263Z ^ 2025-05-07T19:54:48.7143643Z 2025-05-07T19:54:48.7145250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.7147904Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.7148847Z ^ 2025-05-07T19:54:48.7149086Z 2025-05-07T19:54:48.7149468Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:48.7150088Z 2025-05-07T19:54:48.7151782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.7154499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.7155732Z ^ 2025-05-07T19:54:48.7156103Z 2025-05-07T19:54:48.7157819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.7160507Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.7161603Z ^ 2025-05-07T19:54:48.7161854Z 2025-05-07T19:54:48.7162289Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:48.7162942Z 2025-05-07T19:54:48.7164439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.7166908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.7168113Z ^ 2025-05-07T19:54:48.7168458Z 2025-05-07T19:54:48.7170023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.7173076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.7173918Z ^ 2025-05-07T19:54:48.7174123Z 2025-05-07T19:54:48.7174649Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:48.7175287Z 2025-05-07T19:54:48.7176888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.7179686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.7181068Z ^ 2025-05-07T19:54:48.7181440Z 2025-05-07T19:54:48.7183093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.7185783Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.7186953Z ^ 2025-05-07T19:54:48.7187169Z 2025-05-07T19:54:48.7187537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:48.7188199Z 2025-05-07T19:54:48.7189686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.7192192Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:48.7193402Z ^ 2025-05-07T19:54:48.7193774Z 2025-05-07T19:54:49.5000153Z [200/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:49.5024661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.5027281Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.5028396Z ^ 2025-05-07T19:54:49.5028635Z 2025-05-07T19:54:49.5029060Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:49.5029706Z 2025-05-07T19:54:49.5031278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.5033853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.5035000Z ^ 2025-05-07T19:54:49.5035371Z 2025-05-07T19:54:49.5037021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5038930Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5039475Z ^ 2025-05-07T19:54:49.5039780Z 2025-05-07T19:54:49.5041408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5043419Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5043970Z ^ 2025-05-07T19:54:49.5044369Z 2025-05-07T19:54:49.5046022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5048107Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5048664Z ^ 2025-05-07T19:54:49.5048959Z 2025-05-07T19:54:49.5050711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.5053444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.5054673Z ^ 2025-05-07T19:54:49.5054928Z 2025-05-07T19:54:49.5055408Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:49.5056093Z 2025-05-07T19:54:49.5057830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.5061268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.5062467Z ^ 2025-05-07T19:54:49.5062849Z 2025-05-07T19:54:49.5064682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5066578Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5067137Z ^ 2025-05-07T19:54:49.5067441Z 2025-05-07T19:54:49.5069041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5071026Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5071578Z ^ 2025-05-07T19:54:49.5071873Z 2025-05-07T19:54:49.5073547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5075576Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5076153Z ^ 2025-05-07T19:54:49.5076458Z 2025-05-07T19:54:49.5078173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.5080819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.5082043Z ^ 2025-05-07T19:54:49.5082301Z 2025-05-07T19:54:49.5082756Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:49.5083426Z 2025-05-07T19:54:49.5085121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.5087852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.5089061Z ^ 2025-05-07T19:54:49.5089450Z 2025-05-07T19:54:49.5091099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5093147Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5093705Z ^ 2025-05-07T19:54:49.5094015Z 2025-05-07T19:54:49.5095613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5097670Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5098180Z ^ 2025-05-07T19:54:49.5098478Z 2025-05-07T19:54:49.5100185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5102110Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5102608Z ^ 2025-05-07T19:54:49.5102872Z 2025-05-07T19:54:49.5104712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.5107690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.5109019Z ^ 2025-05-07T19:54:49.5109284Z 2025-05-07T19:54:49.5109756Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:49.5110471Z 2025-05-07T19:54:49.5112223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.5115014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.5116264Z ^ 2025-05-07T19:54:49.5116626Z 2025-05-07T19:54:49.5118226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5120336Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5120923Z ^ 2025-05-07T19:54:49.5121255Z 2025-05-07T19:54:49.5123211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5125349Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5125923Z ^ 2025-05-07T19:54:49.5126231Z 2025-05-07T19:54:49.5127948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5130147Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5130748Z ^ 2025-05-07T19:54:49.5131054Z 2025-05-07T19:54:49.5132857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.5135645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.5136828Z ^ 2025-05-07T19:54:49.5137082Z 2025-05-07T19:54:49.5137551Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:49.5138273Z 2025-05-07T19:54:49.5140094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.5143106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.5144272Z ^ 2025-05-07T19:54:49.5144644Z 2025-05-07T19:54:49.5146171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5148142Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5148733Z ^ 2025-05-07T19:54:49.5149049Z 2025-05-07T19:54:49.5150798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5153594Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5154196Z ^ 2025-05-07T19:54:49.5154508Z 2025-05-07T19:54:49.5156432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.5158599Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.5159197Z ^ 2025-05-07T19:54:49.5159512Z 2025-05-07T19:54:50.4797121Z [201/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:50.4815870Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:51.5033679Z [202/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:51.5056940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.5060041Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.5061289Z ^ 2025-05-07T19:54:51.5061547Z 2025-05-07T19:54:51.5062002Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.5062683Z 2025-05-07T19:54:51.5064430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.5067031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.5068226Z ^ 2025-05-07T19:54:51.5068576Z 2025-05-07T19:54:51.5070285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.5072831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.5073938Z ^ 2025-05-07T19:54:51.5074183Z 2025-05-07T19:54:51.5074678Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.5075347Z 2025-05-07T19:54:51.5076890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.5079820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.5081087Z ^ 2025-05-07T19:54:51.5081459Z 2025-05-07T19:54:51.5082985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.5085806Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.5087023Z ^ 2025-05-07T19:54:51.5087270Z 2025-05-07T19:54:51.5087686Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.5088294Z 2025-05-07T19:54:51.5089878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.5092445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.5093666Z ^ 2025-05-07T19:54:51.5094040Z 2025-05-07T19:54:51.5095579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.5098134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.5099254Z ^ 2025-05-07T19:54:51.5099697Z 2025-05-07T19:54:51.5100166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.5100803Z 2025-05-07T19:54:51.5102430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.5105141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.5106339Z ^ 2025-05-07T19:54:51.5106729Z 2025-05-07T19:54:51.5108435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.5111153Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.5112291Z ^ 2025-05-07T19:54:51.5112509Z 2025-05-07T19:54:51.5112922Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.5113566Z 2025-05-07T19:54:51.5115186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.5117837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.5119028Z ^ 2025-05-07T19:54:51.5119393Z 2025-05-07T19:54:51.6065625Z [203/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:51.6085637Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:52.3531115Z [204/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:52.3551686Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:52.6754068Z [205/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:54:52.6773667Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:53.7903554Z [206/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:53.7924999Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.4575882Z [207/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.4596621Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.4845434Z [208/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:56.4867329Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.7693293Z [209/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:57.7714845Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.1000918Z [210/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:55:01.1019156Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.1492707Z [211/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:01.1511783Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.4491231Z [212/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:55:01.4508875Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.4681751Z [213/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:55:02.4700522Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.6542898Z [214/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:55:02.6560392Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:03.2180783Z [215/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:03.2201280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.2203789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:03.2204846Z ^ 2025-05-07T19:55:03.2205073Z 2025-05-07T19:55:03.2205457Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.2206049Z 2025-05-07T19:55:03.2207534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.2209965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:03.2210994Z ^ 2025-05-07T19:55:03.2211343Z 2025-05-07T19:55:03.2212830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.2215221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:03.2216325Z ^ 2025-05-07T19:55:03.2216562Z 2025-05-07T19:55:03.2216964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.2217526Z 2025-05-07T19:55:03.2218997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.2221818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:03.2223108Z ^ 2025-05-07T19:55:03.2223686Z 2025-05-07T19:55:03.2225130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.2227441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:03.2228471Z ^ 2025-05-07T19:55:03.2228704Z 2025-05-07T19:55:03.2229099Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.2229734Z 2025-05-07T19:55:03.2231249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.2233616Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:03.2234658Z ^ 2025-05-07T19:55:03.2234997Z 2025-05-07T19:55:03.2236441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.2238829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:03.2239882Z ^ 2025-05-07T19:55:03.2240118Z 2025-05-07T19:55:03.2240527Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.2241131Z 2025-05-07T19:55:03.2242604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.2245051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:03.2246165Z ^ 2025-05-07T19:55:03.2246468Z 2025-05-07T19:55:03.2247844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.2250110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:03.2251117Z ^ 2025-05-07T19:55:03.2251354Z 2025-05-07T19:55:03.2251724Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.2252282Z 2025-05-07T19:55:03.2253719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.2256024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:03.2257119Z ^ 2025-05-07T19:55:03.2257466Z 2025-05-07T19:55:03.3724648Z [216/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:55:03.3742546Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:04.2151838Z [217/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:55:04.2173703Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:04.3128261Z [218/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:55:04.3146377Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.0773280Z [219/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:55:05.0791346Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.4085490Z [220/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:05.4107963Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.7043501Z [221/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:55:05.7061903Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:06.3795493Z [222/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:06.3814950Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:06.6249749Z [223/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:55:06.6268153Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:06.9718022Z [224/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:55:06.9735713Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:07.1212533Z [225/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T19:55:07.5604441Z [226/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:07.5622635Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:08.6886007Z [227/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:08.6905418Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:09.2549876Z [228/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:09.2570292Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.3035109Z [229/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:10.3054338Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:11.4322941Z [230/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:55:11.4347591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:11.4350340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:11.4351517Z ^ 2025-05-07T19:55:11.4351769Z 2025-05-07T19:55:11.4352205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:11.4352823Z 2025-05-07T19:55:11.4354487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:11.4357098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:11.4358319Z ^ 2025-05-07T19:55:11.4358684Z 2025-05-07T19:55:11.4360280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4362272Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4362866Z ^ 2025-05-07T19:55:11.4363175Z 2025-05-07T19:55:11.4364848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4366903Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4367505Z ^ 2025-05-07T19:55:11.4367804Z 2025-05-07T19:55:11.4369443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4371506Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4372070Z ^ 2025-05-07T19:55:11.4372389Z 2025-05-07T19:55:11.4374099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:11.4376806Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:11.4377995Z ^ 2025-05-07T19:55:11.4378289Z 2025-05-07T19:55:11.4378752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:11.4379550Z 2025-05-07T19:55:11.4381307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:11.4384109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:11.4385289Z ^ 2025-05-07T19:55:11.4385669Z 2025-05-07T19:55:11.4387447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4389549Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4390249Z ^ 2025-05-07T19:55:11.4390566Z 2025-05-07T19:55:11.4392229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4394273Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4394878Z ^ 2025-05-07T19:55:11.4395167Z 2025-05-07T19:55:11.4396799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4398830Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4399396Z ^ 2025-05-07T19:55:11.4399736Z 2025-05-07T19:55:11.4401360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:11.4404129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:11.4405273Z ^ 2025-05-07T19:55:11.4405562Z 2025-05-07T19:55:11.4406011Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:11.4406702Z 2025-05-07T19:55:11.4408405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:11.4411341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:11.4412474Z ^ 2025-05-07T19:55:11.4412848Z 2025-05-07T19:55:11.4414476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4416579Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4417182Z ^ 2025-05-07T19:55:11.4417463Z 2025-05-07T19:55:11.4418787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4420913Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4421488Z ^ 2025-05-07T19:55:11.4421819Z 2025-05-07T19:55:11.4423568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4425598Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4426151Z ^ 2025-05-07T19:55:11.4426682Z 2025-05-07T19:55:11.4428274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:11.4431040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:11.4432235Z ^ 2025-05-07T19:55:11.4432490Z 2025-05-07T19:55:11.4432976Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:11.4433662Z 2025-05-07T19:55:11.4435321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:11.4438213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:11.4439463Z ^ 2025-05-07T19:55:11.4439843Z 2025-05-07T19:55:11.4441471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4443549Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4444161Z ^ 2025-05-07T19:55:11.4444470Z 2025-05-07T19:55:11.4446048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4448055Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4448628Z ^ 2025-05-07T19:55:11.4448959Z 2025-05-07T19:55:11.4450497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4452416Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4452983Z ^ 2025-05-07T19:55:11.4453320Z 2025-05-07T19:55:11.4454963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:11.4457746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:11.4458969Z ^ 2025-05-07T19:55:11.4459241Z 2025-05-07T19:55:11.4459846Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:11.4460509Z 2025-05-07T19:55:11.4462148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:11.4464827Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:11.4466081Z ^ 2025-05-07T19:55:11.4466462Z 2025-05-07T19:55:11.4468112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4470210Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4470796Z ^ 2025-05-07T19:55:11.4471115Z 2025-05-07T19:55:11.4472835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4474903Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4475473Z ^ 2025-05-07T19:55:11.4475795Z 2025-05-07T19:55:11.4477587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:11.4479672Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:11.4480322Z ^ 2025-05-07T19:55:11.4480620Z 2025-05-07T19:55:13.6731776Z [231/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:13.6754277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.6756832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.6757920Z ^ 2025-05-07T19:55:13.6758165Z 2025-05-07T19:55:13.6758588Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.6759558Z 2025-05-07T19:55:13.6761140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.6763944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.6764861Z ^ 2025-05-07T19:55:13.6765175Z 2025-05-07T19:55:13.6766644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6768841Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.6769588Z ^ 2025-05-07T19:55:13.6769896Z 2025-05-07T19:55:13.6771367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6773248Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6773813Z ^ 2025-05-07T19:55:13.6774096Z 2025-05-07T19:55:13.6775538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6777422Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6777923Z ^ 2025-05-07T19:55:13.6778205Z 2025-05-07T19:55:13.6779800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6781726Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6782231Z ^ 2025-05-07T19:55:13.6782486Z 2025-05-07T19:55:13.6784022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.6786621Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.6787622Z ^ 2025-05-07T19:55:13.6787888Z 2025-05-07T19:55:13.6788340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.6788999Z 2025-05-07T19:55:13.6790628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.6793288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.6794466Z ^ 2025-05-07T19:55:13.6794828Z 2025-05-07T19:55:13.6796373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6798456Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.6799204Z ^ 2025-05-07T19:55:13.6799482Z 2025-05-07T19:55:13.6800998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6803021Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6803547Z ^ 2025-05-07T19:55:13.6803819Z 2025-05-07T19:55:13.6805513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6807434Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6807974Z ^ 2025-05-07T19:55:13.6808256Z 2025-05-07T19:55:13.6809704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6811663Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6812197Z ^ 2025-05-07T19:55:13.6812468Z 2025-05-07T19:55:13.6814037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.6816522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.6817659Z ^ 2025-05-07T19:55:13.6817903Z 2025-05-07T19:55:13.6818358Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.6818980Z 2025-05-07T19:55:13.6820728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.6823512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.6824657Z ^ 2025-05-07T19:55:13.6825011Z 2025-05-07T19:55:13.6826491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6828423Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.6829110Z ^ 2025-05-07T19:55:13.6829413Z 2025-05-07T19:55:13.6830866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6832776Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6833284Z ^ 2025-05-07T19:55:13.6833570Z 2025-05-07T19:55:13.6834996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6836834Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6837382Z ^ 2025-05-07T19:55:13.6837644Z 2025-05-07T19:55:13.6839118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6841009Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6841550Z ^ 2025-05-07T19:55:13.6841804Z 2025-05-07T19:55:13.6843355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.6846110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.6847200Z ^ 2025-05-07T19:55:13.6847439Z 2025-05-07T19:55:13.6848147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.6848797Z 2025-05-07T19:55:13.6850339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.6853120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.6854280Z ^ 2025-05-07T19:55:13.6854663Z 2025-05-07T19:55:13.6856165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6858279Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.6859015Z ^ 2025-05-07T19:55:13.6859293Z 2025-05-07T19:55:13.6860800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6862660Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6863223Z ^ 2025-05-07T19:55:13.6863502Z 2025-05-07T19:55:13.6865059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6866838Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6867366Z ^ 2025-05-07T19:55:13.6867624Z 2025-05-07T19:55:13.6869058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6870833Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6871337Z ^ 2025-05-07T19:55:13.6871582Z 2025-05-07T19:55:13.6873085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.6875641Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.6876712Z ^ 2025-05-07T19:55:13.6876967Z 2025-05-07T19:55:13.6877389Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.6878016Z 2025-05-07T19:55:13.6879638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.6882198Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.6883360Z ^ 2025-05-07T19:55:13.6883688Z 2025-05-07T19:55:13.6884988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6887182Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.6887917Z ^ 2025-05-07T19:55:13.6888191Z 2025-05-07T19:55:13.6889880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6891731Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6892215Z ^ 2025-05-07T19:55:13.6892488Z 2025-05-07T19:55:13.6894057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6895920Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6896462Z ^ 2025-05-07T19:55:13.6896727Z 2025-05-07T19:55:13.6898207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.6900281Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.6900794Z ^ 2025-05-07T19:55:13.6901066Z 2025-05-07T19:55:13.7221606Z [232/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:13.7241304Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:13.9313537Z [233/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:13.9332518Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:15.1272655Z [234/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:15.1290865Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:16.6884270Z [235/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:16.6905221Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:16.7050970Z [236/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:16.7072136Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:20.2565237Z [237/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:20.2588014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:20.2590998Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:20.2592147Z ^ 2025-05-07T19:55:20.2592371Z 2025-05-07T19:55:20.2592785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:20.2593423Z 2025-05-07T19:55:20.2595420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:20.2598011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:20.2599346Z ^ 2025-05-07T19:55:20.2599719Z 2025-05-07T19:55:20.2601150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2603146Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:20.2603838Z ^ 2025-05-07T19:55:20.2604137Z 2025-05-07T19:55:20.2605594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2607484Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2608034Z ^ 2025-05-07T19:55:20.2608318Z 2025-05-07T19:55:20.2609814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2611673Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2612223Z ^ 2025-05-07T19:55:20.2612504Z 2025-05-07T19:55:20.2613992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2615880Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2616418Z ^ 2025-05-07T19:55:20.2616694Z 2025-05-07T19:55:20.2618291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:20.2621074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:20.2622461Z ^ 2025-05-07T19:55:20.2622717Z 2025-05-07T19:55:20.2623143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:20.2623785Z 2025-05-07T19:55:20.2625414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:20.2627944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:20.2629061Z ^ 2025-05-07T19:55:20.2629426Z 2025-05-07T19:55:20.2630850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2632868Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:20.2633816Z ^ 2025-05-07T19:55:20.2634086Z 2025-05-07T19:55:20.2635545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2637632Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2638166Z ^ 2025-05-07T19:55:20.2638448Z 2025-05-07T19:55:20.2639936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2641980Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2642504Z ^ 2025-05-07T19:55:20.2642776Z 2025-05-07T19:55:20.2644215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2646095Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2646645Z ^ 2025-05-07T19:55:20.2646915Z 2025-05-07T19:55:20.2648502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:20.2650996Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:20.2652109Z ^ 2025-05-07T19:55:20.2652350Z 2025-05-07T19:55:20.2652737Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:20.2653327Z 2025-05-07T19:55:20.2654891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:20.2657422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:20.2658590Z ^ 2025-05-07T19:55:20.2658940Z 2025-05-07T19:55:20.2660523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2662452Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:20.2663172Z ^ 2025-05-07T19:55:20.2663448Z 2025-05-07T19:55:20.2664865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2666773Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2667294Z ^ 2025-05-07T19:55:20.2667570Z 2025-05-07T19:55:20.2669085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2670931Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2671473Z ^ 2025-05-07T19:55:20.2671748Z 2025-05-07T19:55:20.2673235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2675245Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2675767Z ^ 2025-05-07T19:55:20.2676044Z 2025-05-07T19:55:20.2677772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:20.2680331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:20.2681420Z ^ 2025-05-07T19:55:20.2681662Z 2025-05-07T19:55:20.2682067Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:20.2682839Z 2025-05-07T19:55:20.2684416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:20.2686903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:20.2687936Z ^ 2025-05-07T19:55:20.2688246Z 2025-05-07T19:55:20.2689663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2691690Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:20.2692419Z ^ 2025-05-07T19:55:20.2692701Z 2025-05-07T19:55:20.2694160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2695853Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2696355Z ^ 2025-05-07T19:55:20.2696605Z 2025-05-07T19:55:20.2698013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2700026Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2700543Z ^ 2025-05-07T19:55:20.2700801Z 2025-05-07T19:55:20.2702092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2703906Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2704397Z ^ 2025-05-07T19:55:20.2704655Z 2025-05-07T19:55:20.2706210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:20.2708689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:20.2709793Z ^ 2025-05-07T19:55:20.2710054Z 2025-05-07T19:55:20.2710468Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:20.2711070Z 2025-05-07T19:55:20.2712618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:20.2715182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:20.2716504Z ^ 2025-05-07T19:55:20.2716848Z 2025-05-07T19:55:20.2718323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2720421Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:20.2721098Z ^ 2025-05-07T19:55:20.2721365Z 2025-05-07T19:55:20.2723057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2724942Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2725454Z ^ 2025-05-07T19:55:20.2725696Z 2025-05-07T19:55:20.2727042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2728804Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2729255Z ^ 2025-05-07T19:55:20.2729514Z 2025-05-07T19:55:20.2730942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:20.2732709Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:20.2733236Z ^ 2025-05-07T19:55:20.2733511Z 2025-05-07T19:55:25.8481495Z [238/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:25.8505135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8507930Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:25.8509232Z ^ 2025-05-07T19:55:25.8509513Z 2025-05-07T19:55:25.8509938Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:25.8510576Z 2025-05-07T19:55:25.8512236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8514905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:25.8516094Z ^ 2025-05-07T19:55:25.8516458Z 2025-05-07T19:55:25.8518083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8520749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:25.8522241Z ^ 2025-05-07T19:55:25.8522492Z 2025-05-07T19:55:25.8522906Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:25.8523570Z 2025-05-07T19:55:25.8525203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8527887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:25.8529090Z ^ 2025-05-07T19:55:25.8529470Z 2025-05-07T19:55:25.8531009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8533693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:25.8534828Z ^ 2025-05-07T19:55:25.8535067Z 2025-05-07T19:55:25.8535525Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:25.8536173Z 2025-05-07T19:55:25.8537793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8540616Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:25.8541791Z ^ 2025-05-07T19:55:25.8542161Z 2025-05-07T19:55:25.8543766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8546697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:25.8547899Z ^ 2025-05-07T19:55:25.8548147Z 2025-05-07T19:55:25.8548567Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:25.8549233Z 2025-05-07T19:55:25.8551112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8553830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:25.8555082Z ^ 2025-05-07T19:55:25.8555457Z 2025-05-07T19:55:25.8557122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8559729Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:25.8560895Z ^ 2025-05-07T19:55:25.8561145Z 2025-05-07T19:55:25.8561613Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:25.8562251Z 2025-05-07T19:55:25.8563904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8566580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:25.8567756Z ^ 2025-05-07T19:55:25.8568137Z 2025-05-07T19:55:30.4717487Z [239/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:30.4739964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4742640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.4743787Z ^ 2025-05-07T19:55:30.4744057Z 2025-05-07T19:55:30.4744514Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:30.4745120Z 2025-05-07T19:55:30.4746725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4749270Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.4750427Z ^ 2025-05-07T19:55:30.4750792Z 2025-05-07T19:55:30.4752214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4754240Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:30.4754959Z ^ 2025-05-07T19:55:30.4755240Z 2025-05-07T19:55:30.4756631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4758355Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4758844Z ^ 2025-05-07T19:55:30.4759095Z 2025-05-07T19:55:30.4760445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4762258Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4762746Z ^ 2025-05-07T19:55:30.4763030Z 2025-05-07T19:55:30.4764385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4766197Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4766726Z ^ 2025-05-07T19:55:30.4766976Z 2025-05-07T19:55:30.4768496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4770985Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.4772092Z ^ 2025-05-07T19:55:30.4772332Z 2025-05-07T19:55:30.4772757Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:30.4773678Z 2025-05-07T19:55:30.4775188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4777957Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.4779088Z ^ 2025-05-07T19:55:30.4779433Z 2025-05-07T19:55:30.4780943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4783042Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:30.4783678Z ^ 2025-05-07T19:55:30.4783995Z 2025-05-07T19:55:30.4785411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4787282Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4787828Z ^ 2025-05-07T19:55:30.4788096Z 2025-05-07T19:55:30.4789513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4791370Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4791898Z ^ 2025-05-07T19:55:30.4792161Z 2025-05-07T19:55:30.4793596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4795422Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4795964Z ^ 2025-05-07T19:55:30.4796225Z 2025-05-07T19:55:30.4797632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4800127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.4801224Z ^ 2025-05-07T19:55:30.4801482Z 2025-05-07T19:55:30.4801898Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:30.4802549Z 2025-05-07T19:55:30.4804070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4806670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.4807729Z ^ 2025-05-07T19:55:30.4808103Z 2025-05-07T19:55:30.4809524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4811533Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:30.4812270Z ^ 2025-05-07T19:55:30.4812524Z 2025-05-07T19:55:30.4813949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4815959Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4816544Z ^ 2025-05-07T19:55:30.4816833Z 2025-05-07T19:55:30.4818451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4820376Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4820924Z ^ 2025-05-07T19:55:30.4821196Z 2025-05-07T19:55:30.4822816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4824825Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4825364Z ^ 2025-05-07T19:55:30.4825647Z 2025-05-07T19:55:30.4827190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4829610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.4830711Z ^ 2025-05-07T19:55:30.4830968Z 2025-05-07T19:55:30.4831382Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:30.4831980Z 2025-05-07T19:55:30.4833545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4836042Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.4837172Z ^ 2025-05-07T19:55:30.4837494Z 2025-05-07T19:55:30.4838965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4840930Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:30.4841662Z ^ 2025-05-07T19:55:30.4841919Z 2025-05-07T19:55:30.4843328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4845081Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4845645Z ^ 2025-05-07T19:55:30.4845895Z 2025-05-07T19:55:30.4847338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4849128Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4849633Z ^ 2025-05-07T19:55:30.4849919Z 2025-05-07T19:55:30.4851321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4853172Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4853712Z ^ 2025-05-07T19:55:30.4854006Z 2025-05-07T19:55:30.4855536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4858293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.4859347Z ^ 2025-05-07T19:55:30.4859667Z 2025-05-07T19:55:30.4860298Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:30.4860910Z 2025-05-07T19:55:30.4862420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4865007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.4866063Z ^ 2025-05-07T19:55:30.4866431Z 2025-05-07T19:55:30.4867870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4869843Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:30.4870545Z ^ 2025-05-07T19:55:30.4870811Z 2025-05-07T19:55:30.4872248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4874023Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4874557Z ^ 2025-05-07T19:55:30.4874862Z 2025-05-07T19:55:30.4876268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4878134Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4878643Z ^ 2025-05-07T19:55:30.4878926Z 2025-05-07T19:55:30.4880372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.4882232Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.4882780Z ^ 2025-05-07T19:55:30.4883040Z 2025-05-07T19:55:33.8181393Z [240/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:33.8201290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.8203681Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.8204800Z ^ 2025-05-07T19:55:33.8205043Z 2025-05-07T19:55:33.8205449Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.8206092Z 2025-05-07T19:55:33.8207642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.8210126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.8211286Z ^ 2025-05-07T19:55:33.8211705Z 2025-05-07T19:55:33.8213280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8215395Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.8216111Z ^ 2025-05-07T19:55:33.8216385Z 2025-05-07T19:55:33.8217951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8220036Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8220624Z ^ 2025-05-07T19:55:33.8220920Z 2025-05-07T19:55:33.8222728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8224756Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8225357Z ^ 2025-05-07T19:55:33.8225642Z 2025-05-07T19:55:33.8227274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8229294Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8230068Z ^ 2025-05-07T19:55:33.8230336Z 2025-05-07T19:55:33.8232052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.8234978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.8236201Z ^ 2025-05-07T19:55:33.8236453Z 2025-05-07T19:55:33.8236908Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.8237659Z 2025-05-07T19:55:33.8239335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.8241966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.8243176Z ^ 2025-05-07T19:55:33.8243548Z 2025-05-07T19:55:33.8245202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8247316Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.8248076Z ^ 2025-05-07T19:55:33.8248354Z 2025-05-07T19:55:33.8249947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8251679Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8252260Z ^ 2025-05-07T19:55:33.8252540Z 2025-05-07T19:55:33.8254077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8256055Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8256599Z ^ 2025-05-07T19:55:33.8256868Z 2025-05-07T19:55:33.8258437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8260521Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8261040Z ^ 2025-05-07T19:55:33.8261321Z 2025-05-07T19:55:33.8262960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.8265746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.8266944Z ^ 2025-05-07T19:55:33.8267205Z 2025-05-07T19:55:33.8267672Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.8268318Z 2025-05-07T19:55:33.8270004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.8272721Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.8273896Z ^ 2025-05-07T19:55:33.8274401Z 2025-05-07T19:55:33.8275947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8279468Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.8280232Z ^ 2025-05-07T19:55:33.8280500Z 2025-05-07T19:55:33.8282025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8284059Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8284633Z ^ 2025-05-07T19:55:33.8284914Z 2025-05-07T19:55:33.8286510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8288469Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8288977Z ^ 2025-05-07T19:55:33.8289261Z 2025-05-07T19:55:33.8290805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8292769Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8293313Z ^ 2025-05-07T19:55:33.8293591Z 2025-05-07T19:55:33.8295293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.8298015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.8299124Z ^ 2025-05-07T19:55:33.8299358Z 2025-05-07T19:55:33.8299970Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.8300621Z 2025-05-07T19:55:33.8302274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.8304997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.8306193Z ^ 2025-05-07T19:55:33.8306550Z 2025-05-07T19:55:33.8308139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8310251Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.8310967Z ^ 2025-05-07T19:55:33.8311266Z 2025-05-07T19:55:33.8312800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8314785Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8315326Z ^ 2025-05-07T19:55:33.8315626Z 2025-05-07T19:55:33.8317187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8319191Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8319861Z ^ 2025-05-07T19:55:33.8320128Z 2025-05-07T19:55:33.8321703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8324165Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8324717Z ^ 2025-05-07T19:55:33.8324963Z 2025-05-07T19:55:33.8326583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.8329310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.8330459Z ^ 2025-05-07T19:55:33.8330700Z 2025-05-07T19:55:33.8331143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.8331802Z 2025-05-07T19:55:33.8333477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.8336196Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.8337349Z ^ 2025-05-07T19:55:33.8337714Z 2025-05-07T19:55:33.8339290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8341504Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.8342228Z ^ 2025-05-07T19:55:33.8342526Z 2025-05-07T19:55:33.8344069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8346050Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8346600Z ^ 2025-05-07T19:55:33.8346880Z 2025-05-07T19:55:33.8348475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8350359Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8350904Z ^ 2025-05-07T19:55:33.8351160Z 2025-05-07T19:55:33.8352663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8354633Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8355173Z ^ 2025-05-07T19:55:33.8355431Z 2025-05-07T19:55:50.6236218Z [241/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:50.6258742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.6261766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.6262917Z ^ 2025-05-07T19:55:50.6263153Z 2025-05-07T19:55:50.6263632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:50.6264351Z 2025-05-07T19:55:50.6266112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.6269139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.6270423Z ^ 2025-05-07T19:55:50.6270804Z 2025-05-07T19:55:50.6272492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.6275139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.6276345Z ^ 2025-05-07T19:55:50.6276600Z 2025-05-07T19:55:50.6277018Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:50.6277721Z 2025-05-07T19:55:50.6279533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.6282511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.6284055Z ^ 2025-05-07T19:55:50.6284449Z 2025-05-07T19:55:50.6286248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.6290893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.6292243Z ^ 2025-05-07T19:55:50.6292570Z 2025-05-07T19:55:50.6293073Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:50.6293902Z 2025-05-07T19:55:50.6295776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.6298848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.6300317Z ^ 2025-05-07T19:55:50.6300706Z 2025-05-07T19:55:50.6302609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.6305567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.6306878Z ^ 2025-05-07T19:55:50.6307143Z 2025-05-07T19:55:50.6307656Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:50.6308414Z 2025-05-07T19:55:50.6310273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.6313315Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.6314621Z ^ 2025-05-07T19:55:50.6315009Z 2025-05-07T19:55:50.6316799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.6319707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.6320922Z ^ 2025-05-07T19:55:50.6321204Z 2025-05-07T19:55:50.6321566Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:50.6322548Z 2025-05-07T19:55:50.6324352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.6326976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.6328096Z ^ 2025-05-07T19:55:50.6328455Z 2025-05-07T19:55:52.0230055Z [242/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:55:52.0254942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0257771Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0258988Z ^ 2025-05-07T19:55:52.0259269Z 2025-05-07T19:55:52.0259840Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:52.0260557Z 2025-05-07T19:55:52.0262323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0265091Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0266289Z ^ 2025-05-07T19:55:52.0266670Z 2025-05-07T19:55:52.0268363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0270371Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0270946Z ^ 2025-05-07T19:55:52.0271258Z 2025-05-07T19:55:52.0272878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0275071Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0275659Z ^ 2025-05-07T19:55:52.0275944Z 2025-05-07T19:55:52.0277911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0279955Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0280546Z ^ 2025-05-07T19:55:52.0280837Z 2025-05-07T19:55:52.0282560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0285485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0286756Z ^ 2025-05-07T19:55:52.0287024Z 2025-05-07T19:55:52.0287505Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:52.0288239Z 2025-05-07T19:55:52.0290029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0292867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0294148Z ^ 2025-05-07T19:55:52.0294544Z 2025-05-07T19:55:52.0296229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0298257Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0298818Z ^ 2025-05-07T19:55:52.0299107Z 2025-05-07T19:55:52.0300948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0303088Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0303682Z ^ 2025-05-07T19:55:52.0303995Z 2025-05-07T19:55:52.0305698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0307807Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0308407Z ^ 2025-05-07T19:55:52.0308713Z 2025-05-07T19:55:52.0310426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0313243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0314465Z ^ 2025-05-07T19:55:52.0314693Z 2025-05-07T19:55:52.0315144Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:52.0315784Z 2025-05-07T19:55:52.0317639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0320598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0322371Z ^ 2025-05-07T19:55:52.0322778Z 2025-05-07T19:55:52.0324832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0326926Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0327538Z ^ 2025-05-07T19:55:52.0327838Z 2025-05-07T19:55:52.0329575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0331829Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0332441Z ^ 2025-05-07T19:55:52.0332743Z 2025-05-07T19:55:52.0334432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0336534Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0337152Z ^ 2025-05-07T19:55:52.0337486Z 2025-05-07T19:55:52.0339316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0342420Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0343625Z ^ 2025-05-07T19:55:52.0343871Z 2025-05-07T19:55:52.0344323Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:52.0345042Z 2025-05-07T19:55:52.0346876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0349790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0351067Z ^ 2025-05-07T19:55:52.0351458Z 2025-05-07T19:55:52.0353187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0355373Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0355962Z ^ 2025-05-07T19:55:52.0356287Z 2025-05-07T19:55:52.0357987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0360259Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0360868Z ^ 2025-05-07T19:55:52.0361190Z 2025-05-07T19:55:52.0362956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0365224Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0365859Z ^ 2025-05-07T19:55:52.0366182Z 2025-05-07T19:55:52.0367967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0371199Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0372450Z ^ 2025-05-07T19:55:52.0372755Z 2025-05-07T19:55:52.0373259Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:52.0374190Z 2025-05-07T19:55:52.0376120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0379296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0380738Z ^ 2025-05-07T19:55:52.0381144Z 2025-05-07T19:55:52.0382933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0385128Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0385763Z ^ 2025-05-07T19:55:52.0386094Z 2025-05-07T19:55:52.0387898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0390098Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0390732Z ^ 2025-05-07T19:55:52.0391059Z 2025-05-07T19:55:52.0392715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:52.0394899Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:52.0395533Z ^ 2025-05-07T19:55:52.0395851Z 2025-05-07T19:56:02.2617834Z [243/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:56:02.2641535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.2644182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.2645344Z ^ 2025-05-07T19:56:02.2645611Z 2025-05-07T19:56:02.2646094Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.2646764Z 2025-05-07T19:56:02.2648472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.2651208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.2652403Z ^ 2025-05-07T19:56:02.2652771Z 2025-05-07T19:56:02.2654457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.2657200Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.2658337Z ^ 2025-05-07T19:56:02.2658575Z 2025-05-07T19:56:02.2659034Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.2659810Z 2025-05-07T19:56:02.2661517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.2664263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.2665472Z ^ 2025-05-07T19:56:02.2665817Z 2025-05-07T19:56:02.2667446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.2670164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.2671378Z ^ 2025-05-07T19:56:02.2671627Z 2025-05-07T19:56:02.2672070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.2672775Z 2025-05-07T19:56:02.2674439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.2677207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.2678731Z ^ 2025-05-07T19:56:02.2679100Z 2025-05-07T19:56:02.2680735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.2683453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.2684567Z ^ 2025-05-07T19:56:02.2684842Z 2025-05-07T19:56:02.2685290Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.2686045Z 2025-05-07T19:56:02.2687735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.2690422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.2691632Z ^ 2025-05-07T19:56:02.2691998Z 2025-05-07T19:56:02.2693198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.2695000Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.2695809Z ^ 2025-05-07T19:56:02.2695983Z 2025-05-07T19:56:02.2696288Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.2696754Z 2025-05-07T19:56:02.2697871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.2699838Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.2700668Z ^ 2025-05-07T19:56:02.2700929Z 2025-05-07T19:56:04.5182285Z [244/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:56:04.5211900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:04.5231979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:04.5233572Z ^ 2025-05-07T19:56:04.5233914Z 2025-05-07T19:56:04.5234484Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:04.5235351Z 2025-05-07T19:56:04.5237492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:04.5241005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:04.5242524Z ^ 2025-05-07T19:56:04.5242947Z 2025-05-07T19:56:04.5244941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:04.5248230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:04.5249685Z ^ 2025-05-07T19:56:04.5249992Z 2025-05-07T19:56:04.5250513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:04.5251285Z 2025-05-07T19:56:04.5253341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:04.5256648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:04.5258111Z ^ 2025-05-07T19:56:04.5258564Z 2025-05-07T19:56:04.5260746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:04.5264018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:04.5265441Z ^ 2025-05-07T19:56:04.5265754Z 2025-05-07T19:56:04.5266297Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:04.5267097Z 2025-05-07T19:56:04.5269098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:04.5272787Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:04.5274577Z ^ 2025-05-07T19:56:04.5275048Z 2025-05-07T19:56:04.5277173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:04.5283723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:04.5285166Z ^ 2025-05-07T19:56:04.5285462Z 2025-05-07T19:56:04.5285986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:04.5286808Z 2025-05-07T19:56:04.5288889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:04.5292346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:04.5293904Z ^ 2025-05-07T19:56:04.5294392Z 2025-05-07T19:56:04.5296513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:04.5300173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:04.5301564Z ^ 2025-05-07T19:56:04.5301852Z 2025-05-07T19:56:04.5302371Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:04.5303175Z 2025-05-07T19:56:04.5305248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:04.5308442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:04.5309863Z ^ 2025-05-07T19:56:04.5310316Z 2025-05-07T19:56:08.7560310Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:56:08.7580705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.7583277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.7584415Z ^ 2025-05-07T19:56:08.7584652Z 2025-05-07T19:56:08.7585048Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:08.7585664Z 2025-05-07T19:56:08.7587189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.7589708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.7590820Z ^ 2025-05-07T19:56:08.7591184Z 2025-05-07T19:56:08.7592703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.7595052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.7596151Z ^ 2025-05-07T19:56:08.7596378Z 2025-05-07T19:56:08.7596788Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:08.7597422Z 2025-05-07T19:56:08.7598927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.7601086Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.7602143Z ^ 2025-05-07T19:56:08.7602492Z 2025-05-07T19:56:08.7603985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.7606444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.7607519Z ^ 2025-05-07T19:56:08.7609324Z 2025-05-07T19:56:08.7609758Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:08.7610375Z 2025-05-07T19:56:08.7611980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.7614406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.7615505Z ^ 2025-05-07T19:56:08.7615953Z 2025-05-07T19:56:08.7617418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.7619963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.7621037Z ^ 2025-05-07T19:56:08.7621267Z 2025-05-07T19:56:08.7621668Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:08.7622641Z 2025-05-07T19:56:08.7624155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.7626603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.7627690Z ^ 2025-05-07T19:56:08.7628036Z 2025-05-07T19:56:08.7629545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.7632082Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.7633202Z ^ 2025-05-07T19:56:08.7633437Z 2025-05-07T19:56:08.7633855Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:08.7634458Z 2025-05-07T19:56:08.7635931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.7638342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.7639443Z ^ 2025-05-07T19:56:08.7639776Z 2025-05-07T19:56:10.2790157Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:56:10.2813300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.2815847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.2816895Z ^ 2025-05-07T19:56:10.2817163Z 2025-05-07T19:56:10.2817596Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.2818249Z 2025-05-07T19:56:10.2820107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.2822861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.2824055Z ^ 2025-05-07T19:56:10.2824434Z 2025-05-07T19:56:10.2825811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2827744Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2828530Z ^ 2025-05-07T19:56:10.2831646Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:10.2834715Z 2025-05-07T19:56:10.2835874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2837601Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2838606Z ^ 2025-05-07T19:56:10.2841617Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:10.2844785Z 2025-05-07T19:56:10.2846117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2848263Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2849112Z ^ 2025-05-07T19:56:10.2852309Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:10.2855466Z 2025-05-07T19:56:10.2856818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2858907Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2859983Z ^ 2025-05-07T19:56:10.2863407Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:10.2866722Z 2025-05-07T19:56:10.2867987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2869906Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2870788Z ^ 2025-05-07T19:56:10.2874081Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:10.2877271Z 2025-05-07T19:56:10.2878634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2880709Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2881647Z ^ 2025-05-07T19:56:10.2885411Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:10.2888726Z 2025-05-07T19:56:10.2890024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2892079Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2892878Z ^ 2025-05-07T19:56:10.2896016Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:10.2899022Z 2025-05-07T19:56:10.2900548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2902444Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2903353Z ^ 2025-05-07T19:56:10.2906553Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:10.2909738Z 2025-05-07T19:56:10.2910988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2912902Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2913784Z ^ 2025-05-07T19:56:10.2916944Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:10.2920171Z 2025-05-07T19:56:10.2921460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2923687Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2924581Z ^ 2025-05-07T19:56:10.2927922Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:10.2931352Z 2025-05-07T19:56:10.2932846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2934862Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2935738Z ^ 2025-05-07T19:56:10.2939277Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:10.2942527Z 2025-05-07T19:56:10.2943709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2945573Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2946429Z ^ 2025-05-07T19:56:10.2949902Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:10.2953205Z 2025-05-07T19:56:10.2954547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2956601Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2957518Z ^ 2025-05-07T19:56:10.2961005Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:10.2964068Z 2025-05-07T19:56:10.2965353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2967297Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2968183Z ^ 2025-05-07T19:56:10.2971648Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:10.2975146Z 2025-05-07T19:56:10.2976655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2978755Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2979673Z ^ 2025-05-07T19:56:10.2983276Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:10.2986446Z 2025-05-07T19:56:10.2987702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2989419Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.2990241Z ^ 2025-05-07T19:56:10.2993643Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:10.2996653Z 2025-05-07T19:56:10.2997900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.2999962Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3000822Z ^ 2025-05-07T19:56:10.3004028Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:10.3007136Z 2025-05-07T19:56:10.3008456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3010481Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3011345Z ^ 2025-05-07T19:56:10.3014684Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:10.3018031Z 2025-05-07T19:56:10.3019254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3021446Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3022537Z ^ 2025-05-07T19:56:10.3025718Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:10.3029083Z 2025-05-07T19:56:10.3030409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3032463Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3033395Z ^ 2025-05-07T19:56:10.3036969Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:10.3040374Z 2025-05-07T19:56:10.3041742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3043699Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3044611Z ^ 2025-05-07T19:56:10.3047991Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:10.3051220Z 2025-05-07T19:56:10.3052553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3054639Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3055570Z ^ 2025-05-07T19:56:10.3059167Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:10.3062653Z 2025-05-07T19:56:10.3063993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3066186Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3067240Z ^ 2025-05-07T19:56:10.3070515Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:10.3073478Z 2025-05-07T19:56:10.3074725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3076763Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3077652Z ^ 2025-05-07T19:56:10.3081053Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:10.3084243Z 2025-05-07T19:56:10.3085880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.3088671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.3089920Z ^ 2025-05-07T19:56:10.3090178Z 2025-05-07T19:56:10.3090637Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.3091355Z 2025-05-07T19:56:10.3093120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.3095969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.3097196Z ^ 2025-05-07T19:56:10.3097572Z 2025-05-07T19:56:10.3098927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3101054Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3101949Z ^ 2025-05-07T19:56:10.3105328Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:10.3108645Z 2025-05-07T19:56:10.3110036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3112109Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3113187Z ^ 2025-05-07T19:56:10.3116748Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:10.3120110Z 2025-05-07T19:56:10.3121418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3123619Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3124418Z ^ 2025-05-07T19:56:10.3127593Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:10.3130616Z 2025-05-07T19:56:10.3131772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3133510Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3134394Z ^ 2025-05-07T19:56:10.3137552Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:10.3140638Z 2025-05-07T19:56:10.3141920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3143923Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3144784Z ^ 2025-05-07T19:56:10.3148212Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:10.3151476Z 2025-05-07T19:56:10.3152810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3155080Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3155864Z ^ 2025-05-07T19:56:10.3159366Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:10.3162483Z 2025-05-07T19:56:10.3163741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3165679Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3166545Z ^ 2025-05-07T19:56:10.3169663Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:10.3172834Z 2025-05-07T19:56:10.3174160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3176201Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3177133Z ^ 2025-05-07T19:56:10.3180801Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:10.3184122Z 2025-05-07T19:56:10.3185502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3187448Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3188350Z ^ 2025-05-07T19:56:10.3191713Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:10.3194871Z 2025-05-07T19:56:10.3196133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3198308Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3199191Z ^ 2025-05-07T19:56:10.3203473Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:10.3206878Z 2025-05-07T19:56:10.3208249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3210324Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3211266Z ^ 2025-05-07T19:56:10.3214695Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:10.3217872Z 2025-05-07T19:56:10.3219149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3221301Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3222484Z ^ 2025-05-07T19:56:10.3225923Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:10.3229287Z 2025-05-07T19:56:10.3230660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3232713Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3233631Z ^ 2025-05-07T19:56:10.3237167Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:10.3240335Z 2025-05-07T19:56:10.3241659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3243575Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3244620Z ^ 2025-05-07T19:56:10.3248114Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:10.3251277Z 2025-05-07T19:56:10.3252605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3254636Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3255435Z ^ 2025-05-07T19:56:10.3258565Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:10.3261893Z 2025-05-07T19:56:10.3263098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3265080Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3265989Z ^ 2025-05-07T19:56:10.3269205Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:10.3271904Z 2025-05-07T19:56:10.3272951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3274599Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3275356Z ^ 2025-05-07T19:56:10.3278351Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:10.3281073Z 2025-05-07T19:56:10.3282271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3284146Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3284942Z ^ 2025-05-07T19:56:10.3288526Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:10.3291701Z 2025-05-07T19:56:10.3293046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3295229Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3296164Z ^ 2025-05-07T19:56:10.3299889Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:10.3303254Z 2025-05-07T19:56:10.3304507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3306523Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3307390Z ^ 2025-05-07T19:56:10.3310671Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:10.3313533Z 2025-05-07T19:56:10.3314758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3316680Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3317499Z ^ 2025-05-07T19:56:10.3320807Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:10.3324116Z 2025-05-07T19:56:10.3325356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3327297Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3328163Z ^ 2025-05-07T19:56:10.3331731Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:10.3335079Z 2025-05-07T19:56:10.3336270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3338134Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3339088Z ^ 2025-05-07T19:56:10.3342476Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:10.3345615Z 2025-05-07T19:56:10.3346738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3348660Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3349566Z ^ 2025-05-07T19:56:10.3352932Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:10.3356085Z 2025-05-07T19:56:10.3357795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.3360478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.3361668Z ^ 2025-05-07T19:56:10.3361926Z 2025-05-07T19:56:10.3362384Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.3363010Z 2025-05-07T19:56:10.3364638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.3367344Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.3368538Z ^ 2025-05-07T19:56:10.3368871Z 2025-05-07T19:56:10.3370171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3371962Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3372780Z ^ 2025-05-07T19:56:10.3376380Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:10.3379593Z 2025-05-07T19:56:10.3381077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3383074Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3383880Z ^ 2025-05-07T19:56:10.3387195Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:10.3390333Z 2025-05-07T19:56:10.3391667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3393600Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3394466Z ^ 2025-05-07T19:56:10.3397707Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:10.3400939Z 2025-05-07T19:56:10.3402299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3404372Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3405280Z ^ 2025-05-07T19:56:10.3408839Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:10.3412094Z 2025-05-07T19:56:10.3413415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3415385Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3416306Z ^ 2025-05-07T19:56:10.3419647Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:10.3423494Z 2025-05-07T19:56:10.3425108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3427210Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3428182Z ^ 2025-05-07T19:56:10.3431754Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:10.3434974Z 2025-05-07T19:56:10.3436227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3438266Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3439131Z ^ 2025-05-07T19:56:10.3442438Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:10.3445683Z 2025-05-07T19:56:10.3447015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3449083Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3449989Z ^ 2025-05-07T19:56:10.3453517Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:10.3456791Z 2025-05-07T19:56:10.3458136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3460323Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3461205Z ^ 2025-05-07T19:56:10.3464644Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:10.3467996Z 2025-05-07T19:56:10.3469443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3471321Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3472188Z ^ 2025-05-07T19:56:10.3475355Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:10.3478554Z 2025-05-07T19:56:10.3479864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3481753Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3482671Z ^ 2025-05-07T19:56:10.3485995Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:10.3488758Z 2025-05-07T19:56:10.3490025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3491936Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3492808Z ^ 2025-05-07T19:56:10.3496161Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:10.3499497Z 2025-05-07T19:56:10.3501015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3503080Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3504013Z ^ 2025-05-07T19:56:10.3507560Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:10.3510871Z 2025-05-07T19:56:10.3512178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3514312Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3515193Z ^ 2025-05-07T19:56:10.3518497Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:10.3521691Z 2025-05-07T19:56:10.3523279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3525387Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3526291Z ^ 2025-05-07T19:56:10.3529861Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:10.3533218Z 2025-05-07T19:56:10.3534556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3536553Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3537438Z ^ 2025-05-07T19:56:10.3540856Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:10.3543677Z 2025-05-07T19:56:10.3544915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3546768Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3547632Z ^ 2025-05-07T19:56:10.3550998Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:10.3554226Z 2025-05-07T19:56:10.3555402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3557401Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3558497Z ^ 2025-05-07T19:56:10.3561805Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:10.3565054Z 2025-05-07T19:56:10.3566218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3568093Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3568972Z ^ 2025-05-07T19:56:10.3572001Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:10.3574765Z 2025-05-07T19:56:10.3576077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3578134Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3579023Z ^ 2025-05-07T19:56:10.3582413Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:10.3585587Z 2025-05-07T19:56:10.3586880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3588858Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3589759Z ^ 2025-05-07T19:56:10.3593212Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:10.3596446Z 2025-05-07T19:56:10.3597729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3599667Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3600550Z ^ 2025-05-07T19:56:10.3604158Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:10.3607274Z 2025-05-07T19:56:10.3608581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3610473Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3611370Z ^ 2025-05-07T19:56:10.3614682Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:10.3617944Z 2025-05-07T19:56:10.3619218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3621338Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3622429Z ^ 2025-05-07T19:56:10.3625854Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:10.3629100Z 2025-05-07T19:56:10.3630799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.3633498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.3634725Z ^ 2025-05-07T19:56:10.3634979Z 2025-05-07T19:56:10.3635445Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.3636115Z 2025-05-07T19:56:10.3637805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.3640348Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.3641512Z ^ 2025-05-07T19:56:10.3642105Z 2025-05-07T19:56:10.3643401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3645344Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3646457Z ^ 2025-05-07T19:56:10.3649798Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:10.3653102Z 2025-05-07T19:56:10.3654402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3656333Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3657067Z ^ 2025-05-07T19:56:10.3660574Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:10.3663654Z 2025-05-07T19:56:10.3664929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3666972Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3667851Z ^ 2025-05-07T19:56:10.3671060Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:10.3674188Z 2025-05-07T19:56:10.3675498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3677453Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3678334Z ^ 2025-05-07T19:56:10.3681705Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:10.3684708Z 2025-05-07T19:56:10.3685892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3688007Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3688906Z ^ 2025-05-07T19:56:10.3692379Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:10.3695619Z 2025-05-07T19:56:10.3696972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3698970Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3700024Z ^ 2025-05-07T19:56:10.3703426Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:10.3706535Z 2025-05-07T19:56:10.3707594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3709550Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3710459Z ^ 2025-05-07T19:56:10.3713855Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:10.3716992Z 2025-05-07T19:56:10.3718294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3720320Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3721235Z ^ 2025-05-07T19:56:10.3724955Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:10.3728181Z 2025-05-07T19:56:10.3729529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3731572Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3732431Z ^ 2025-05-07T19:56:10.3735969Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:10.3738808Z 2025-05-07T19:56:10.3740239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3742028Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3742911Z ^ 2025-05-07T19:56:10.3746068Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:10.3749213Z 2025-05-07T19:56:10.3750510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3752535Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3753457Z ^ 2025-05-07T19:56:10.3756981Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:10.3760179Z 2025-05-07T19:56:10.3761448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3763438Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3764245Z ^ 2025-05-07T19:56:10.3767555Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:10.3770714Z 2025-05-07T19:56:10.3771905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3773912Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3776106Z ^ 2025-05-07T19:56:10.3779660Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:10.3782971Z 2025-05-07T19:56:10.3784231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3786347Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3787230Z ^ 2025-05-07T19:56:10.3790709Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:10.3793930Z 2025-05-07T19:56:10.3795269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3797285Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3798178Z ^ 2025-05-07T19:56:10.3801681Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:10.3804657Z 2025-05-07T19:56:10.3805977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3807922Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3808797Z ^ 2025-05-07T19:56:10.3812181Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:10.3815004Z 2025-05-07T19:56:10.3816298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3818365Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3819201Z ^ 2025-05-07T19:56:10.3823437Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:10.3826417Z 2025-05-07T19:56:10.3827548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3829494Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3830338Z ^ 2025-05-07T19:56:10.3833787Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:10.3837036Z 2025-05-07T19:56:10.3838376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3840042Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3840837Z ^ 2025-05-07T19:56:10.3844095Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:10.3847118Z 2025-05-07T19:56:10.3848375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3850227Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3851055Z ^ 2025-05-07T19:56:10.3854463Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:10.3857327Z 2025-05-07T19:56:10.3858479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3860543Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3861357Z ^ 2025-05-07T19:56:10.3864973Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:10.3868372Z 2025-05-07T19:56:10.3869477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3871425Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3872390Z ^ 2025-05-07T19:56:10.3875849Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:10.3878775Z 2025-05-07T19:56:10.3880085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3882103Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3882993Z ^ 2025-05-07T19:56:10.3886387Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:10.3889553Z 2025-05-07T19:56:10.3890923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3892854Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3893743Z ^ 2025-05-07T19:56:10.3897077Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:10.3900466Z 2025-05-07T19:56:10.3902178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.3904862Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.3906066Z ^ 2025-05-07T19:56:10.3906323Z 2025-05-07T19:56:10.3906792Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.3907464Z 2025-05-07T19:56:10.3909368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.3912165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.3913366Z ^ 2025-05-07T19:56:10.3913750Z 2025-05-07T19:56:10.3915067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3917234Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3918145Z ^ 2025-05-07T19:56:10.3921680Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:10.3925218Z 2025-05-07T19:56:10.3926525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3928546Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3929373Z ^ 2025-05-07T19:56:10.3932690Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:10.3935741Z 2025-05-07T19:56:10.3936850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3938716Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3939639Z ^ 2025-05-07T19:56:10.3943123Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:10.3946298Z 2025-05-07T19:56:10.3947582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3949309Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3950109Z ^ 2025-05-07T19:56:10.3952990Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:10.3955916Z 2025-05-07T19:56:10.3957165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3958826Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3959751Z ^ 2025-05-07T19:56:10.3962870Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:10.3965728Z 2025-05-07T19:56:10.3966997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3968941Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3969842Z ^ 2025-05-07T19:56:10.3973274Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:10.3976303Z 2025-05-07T19:56:10.3977568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3979565Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3980621Z ^ 2025-05-07T19:56:10.3984071Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:10.3986792Z 2025-05-07T19:56:10.3988086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.3989920Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.3990763Z ^ 2025-05-07T19:56:10.3994069Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:10.3997324Z 2025-05-07T19:56:10.3998538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4000413Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4001264Z ^ 2025-05-07T19:56:10.4004634Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:10.4007890Z 2025-05-07T19:56:10.4009143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4010961Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4011811Z ^ 2025-05-07T19:56:10.4015068Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:10.4017713Z 2025-05-07T19:56:10.4018837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4020952Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4021908Z ^ 2025-05-07T19:56:10.4025551Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:10.4028546Z 2025-05-07T19:56:10.4029868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4031839Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4032792Z ^ 2025-05-07T19:56:10.4036233Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:10.4039701Z 2025-05-07T19:56:10.4041060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4043193Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4044005Z ^ 2025-05-07T19:56:10.4047399Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:10.4050610Z 2025-05-07T19:56:10.4051785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4053794Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4054662Z ^ 2025-05-07T19:56:10.4057933Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:10.4061175Z 2025-05-07T19:56:10.4062518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4064529Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4065430Z ^ 2025-05-07T19:56:10.4068768Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:10.4071970Z 2025-05-07T19:56:10.4073271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4075313Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4076210Z ^ 2025-05-07T19:56:10.4079664Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:10.4083094Z 2025-05-07T19:56:10.4084315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4086153Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4087169Z ^ 2025-05-07T19:56:10.4090534Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:10.4093792Z 2025-05-07T19:56:10.4095091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4097098Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4098007Z ^ 2025-05-07T19:56:10.4101579Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:10.4104475Z 2025-05-07T19:56:10.4105810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4107814Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4108707Z ^ 2025-05-07T19:56:10.4112132Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:10.4115303Z 2025-05-07T19:56:10.4116536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4118420Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4119325Z ^ 2025-05-07T19:56:10.4123070Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:10.4126233Z 2025-05-07T19:56:10.4127528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4129610Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4130427Z ^ 2025-05-07T19:56:10.4133933Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:10.4137193Z 2025-05-07T19:56:10.4138495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4140669Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4141536Z ^ 2025-05-07T19:56:10.4145037Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:10.4148367Z 2025-05-07T19:56:10.4149659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4151631Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4152524Z ^ 2025-05-07T19:56:10.4155675Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:10.4158949Z 2025-05-07T19:56:10.4160260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:10.4162240Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:10.4163135Z ^ 2025-05-07T19:56:10.4166617Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:10.4169903Z 2025-05-07T19:56:20.8141666Z [247/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:56:20.8159978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:20.8162006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:20.8162871Z ^ 2025-05-07T19:56:20.8163091Z 2025-05-07T19:56:20.8163515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:20.8163993Z 2025-05-07T19:56:20.8165246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:20.8167265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:20.8168165Z ^ 2025-05-07T19:56:20.8168444Z 2025-05-07T19:56:20.8169616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8171092Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8171773Z ^ 2025-05-07T19:56:20.8172025Z 2025-05-07T19:56:20.8173263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8174991Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8175490Z ^ 2025-05-07T19:56:20.8175737Z 2025-05-07T19:56:20.8176988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8178642Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8179088Z ^ 2025-05-07T19:56:20.8179311Z 2025-05-07T19:56:20.8180654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:20.8182607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:20.8183598Z ^ 2025-05-07T19:56:20.8183898Z 2025-05-07T19:56:20.8184350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:20.8185044Z 2025-05-07T19:56:20.8186508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:20.8188642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:20.8189562Z ^ 2025-05-07T19:56:20.8189877Z 2025-05-07T19:56:20.8191194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8192806Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8193277Z ^ 2025-05-07T19:56:20.8193525Z 2025-05-07T19:56:20.8194748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8196366Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8196852Z ^ 2025-05-07T19:56:20.8197085Z 2025-05-07T19:56:20.8198241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8199726Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8200149Z ^ 2025-05-07T19:56:20.8200370Z 2025-05-07T19:56:20.8201572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:20.8203787Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:20.8204688Z ^ 2025-05-07T19:56:20.8204899Z 2025-05-07T19:56:20.8205249Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:20.8205777Z 2025-05-07T19:56:20.8207165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:20.8209492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:20.8210562Z ^ 2025-05-07T19:56:20.8210841Z 2025-05-07T19:56:20.8212132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8213682Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8214107Z ^ 2025-05-07T19:56:20.8214324Z 2025-05-07T19:56:20.8215451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8216840Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8217276Z ^ 2025-05-07T19:56:20.8217488Z 2025-05-07T19:56:20.8218729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8220397Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8220801Z ^ 2025-05-07T19:56:20.8221040Z 2025-05-07T19:56:20.8222503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:20.8224678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:20.8225672Z ^ 2025-05-07T19:56:20.8225904Z 2025-05-07T19:56:20.8226276Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:20.8226784Z 2025-05-07T19:56:20.8228175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:20.8230127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:20.8231002Z ^ 2025-05-07T19:56:20.8231260Z 2025-05-07T19:56:20.8232363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8233748Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8234189Z ^ 2025-05-07T19:56:20.8234431Z 2025-05-07T19:56:20.8235668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8237080Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8237478Z ^ 2025-05-07T19:56:20.8237685Z 2025-05-07T19:56:20.8238795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8240248Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8240924Z ^ 2025-05-07T19:56:20.8241146Z 2025-05-07T19:56:20.8242306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:20.8244630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:20.8245588Z ^ 2025-05-07T19:56:20.8245805Z 2025-05-07T19:56:20.8246164Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:20.8246740Z 2025-05-07T19:56:20.8248036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:20.8250102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:20.8251107Z ^ 2025-05-07T19:56:20.8251394Z 2025-05-07T19:56:20.8252675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8254251Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8254710Z ^ 2025-05-07T19:56:20.8254954Z 2025-05-07T19:56:20.8256163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8257808Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8258215Z ^ 2025-05-07T19:56:20.8258452Z 2025-05-07T19:56:20.8259591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:20.8261386Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:20.8261826Z ^ 2025-05-07T19:56:20.8262072Z 2025-05-07T19:56:38.8298874Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:38.8322941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8325739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.8326898Z ^ 2025-05-07T19:56:38.8327181Z 2025-05-07T19:56:38.8327631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:38.8328314Z 2025-05-07T19:56:38.8330025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8332765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.8333890Z ^ 2025-05-07T19:56:38.8334203Z 2025-05-07T19:56:38.8335456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8337127Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:38.8337734Z ^ 2025-05-07T19:56:38.8337955Z 2025-05-07T19:56:38.8339179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8340986Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8341476Z ^ 2025-05-07T19:56:38.8341710Z 2025-05-07T19:56:38.8342936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8344770Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8345272Z ^ 2025-05-07T19:56:38.8345523Z 2025-05-07T19:56:38.8346945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8348782Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8349282Z ^ 2025-05-07T19:56:38.8349558Z 2025-05-07T19:56:38.8351152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8353909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.8355268Z ^ 2025-05-07T19:56:38.8355501Z 2025-05-07T19:56:38.8355964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:38.8356615Z 2025-05-07T19:56:38.8358156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8360814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.8361984Z ^ 2025-05-07T19:56:38.8362329Z 2025-05-07T19:56:38.8363800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8365827Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:38.8366587Z ^ 2025-05-07T19:56:38.8366887Z 2025-05-07T19:56:38.8368464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8370261Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8370827Z ^ 2025-05-07T19:56:38.8371099Z 2025-05-07T19:56:38.8372430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8374388Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8374921Z ^ 2025-05-07T19:56:38.8375196Z 2025-05-07T19:56:38.8376776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8378784Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8379328Z ^ 2025-05-07T19:56:38.8379609Z 2025-05-07T19:56:38.8381428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8384194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.8385386Z ^ 2025-05-07T19:56:38.8385635Z 2025-05-07T19:56:38.8386085Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:38.8386780Z 2025-05-07T19:56:38.8388471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8391088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.8392196Z ^ 2025-05-07T19:56:38.8392654Z 2025-05-07T19:56:38.8394246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8396748Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:38.8397521Z ^ 2025-05-07T19:56:38.8397824Z 2025-05-07T19:56:38.8399565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8401588Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8402159Z ^ 2025-05-07T19:56:38.8402504Z 2025-05-07T19:56:38.8404062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8406108Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8406683Z ^ 2025-05-07T19:56:38.8406968Z 2025-05-07T19:56:38.8408634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8410669Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8411222Z ^ 2025-05-07T19:56:38.8411497Z 2025-05-07T19:56:38.8413195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8415978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.8417191Z ^ 2025-05-07T19:56:38.8417453Z 2025-05-07T19:56:38.8417899Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:38.8418566Z 2025-05-07T19:56:38.8420431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8423487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.8424702Z ^ 2025-05-07T19:56:38.8425066Z 2025-05-07T19:56:38.8426661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8428799Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:38.8429562Z ^ 2025-05-07T19:56:38.8429828Z 2025-05-07T19:56:38.8431425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8433431Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8433981Z ^ 2025-05-07T19:56:38.8434256Z 2025-05-07T19:56:38.8435870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8437899Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8438474Z ^ 2025-05-07T19:56:38.8438751Z 2025-05-07T19:56:38.8440369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8442614Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8443151Z ^ 2025-05-07T19:56:38.8443444Z 2025-05-07T19:56:38.8445408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8448114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.8449380Z ^ 2025-05-07T19:56:38.8449644Z 2025-05-07T19:56:38.8450091Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:38.8450740Z 2025-05-07T19:56:38.8452415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8455113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.8456339Z ^ 2025-05-07T19:56:38.8456709Z 2025-05-07T19:56:38.8458350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8460667Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:38.8461440Z ^ 2025-05-07T19:56:38.8461706Z 2025-05-07T19:56:38.8463318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8465297Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8465863Z ^ 2025-05-07T19:56:38.8466155Z 2025-05-07T19:56:38.8467758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8469756Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8470318Z ^ 2025-05-07T19:56:38.8470575Z 2025-05-07T19:56:38.8472088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.8474088Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.8474651Z ^ 2025-05-07T19:56:38.8474947Z 2025-05-07T19:56:40.8498432Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:40.8521053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.8538169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:40.8539252Z ^ 2025-05-07T19:56:40.8539521Z 2025-05-07T19:56:40.8540074Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:40.8540750Z 2025-05-07T19:56:40.8542352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.8544676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:40.8545758Z ^ 2025-05-07T19:56:40.8546100Z 2025-05-07T19:56:40.8547550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8549562Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:40.8550288Z ^ 2025-05-07T19:56:40.8550554Z 2025-05-07T19:56:40.8551968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8553726Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8554259Z ^ 2025-05-07T19:56:40.8554517Z 2025-05-07T19:56:40.8555946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8558165Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8558663Z ^ 2025-05-07T19:56:40.8558930Z 2025-05-07T19:56:40.8560575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8562339Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8562825Z ^ 2025-05-07T19:56:40.8563090Z 2025-05-07T19:56:40.8564586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.8567266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:40.8568375Z ^ 2025-05-07T19:56:40.8568628Z 2025-05-07T19:56:40.8569089Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:40.8569703Z 2025-05-07T19:56:40.8571276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.8573782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:40.8574901Z ^ 2025-05-07T19:56:40.8575232Z 2025-05-07T19:56:40.8576385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8578269Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:40.8578962Z ^ 2025-05-07T19:56:40.8579216Z 2025-05-07T19:56:40.8580565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8582050Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8582466Z ^ 2025-05-07T19:56:40.8582691Z 2025-05-07T19:56:40.8584021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8585815Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8586316Z ^ 2025-05-07T19:56:40.8586643Z 2025-05-07T19:56:40.8588094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8589931Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8590439Z ^ 2025-05-07T19:56:40.8590695Z 2025-05-07T19:56:40.8592271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.8594806Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:40.8595898Z ^ 2025-05-07T19:56:40.8596133Z 2025-05-07T19:56:40.8596566Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:40.8597359Z 2025-05-07T19:56:40.8598899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.8601576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:40.8602715Z ^ 2025-05-07T19:56:40.8603036Z 2025-05-07T19:56:40.8604458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8606595Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:40.8607329Z ^ 2025-05-07T19:56:40.8607617Z 2025-05-07T19:56:40.8609079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8610924Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8611423Z ^ 2025-05-07T19:56:40.8611703Z 2025-05-07T19:56:40.8613172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8614994Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8615544Z ^ 2025-05-07T19:56:40.8615789Z 2025-05-07T19:56:40.8617206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8619011Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8619549Z ^ 2025-05-07T19:56:40.8619809Z 2025-05-07T19:56:40.8621509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.8623927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:40.8624827Z ^ 2025-05-07T19:56:40.8625033Z 2025-05-07T19:56:40.8625379Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:40.8625992Z 2025-05-07T19:56:40.8627451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.8629828Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:40.8630860Z ^ 2025-05-07T19:56:40.8631194Z 2025-05-07T19:56:40.8632637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8634500Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:40.8635194Z ^ 2025-05-07T19:56:40.8635454Z 2025-05-07T19:56:40.8636888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8639037Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8639523Z ^ 2025-05-07T19:56:40.8639792Z 2025-05-07T19:56:40.8641366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8643174Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8643699Z ^ 2025-05-07T19:56:40.8643947Z 2025-05-07T19:56:40.8645316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8647209Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8647725Z ^ 2025-05-07T19:56:40.8648004Z 2025-05-07T19:56:40.8649562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.8651912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:40.8652795Z ^ 2025-05-07T19:56:40.8653031Z 2025-05-07T19:56:40.8653445Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:40.8654051Z 2025-05-07T19:56:40.8655338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.8657703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:40.8658832Z ^ 2025-05-07T19:56:40.8659165Z 2025-05-07T19:56:40.8660752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8662732Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:40.8663425Z ^ 2025-05-07T19:56:40.8663690Z 2025-05-07T19:56:40.8665145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8666928Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8667460Z ^ 2025-05-07T19:56:40.8667716Z 2025-05-07T19:56:40.8669133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8670949Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8671465Z ^ 2025-05-07T19:56:40.8671738Z 2025-05-07T19:56:40.8673169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:40.8674968Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:40.8675471Z ^ 2025-05-07T19:56:40.8675744Z 2025-05-07T19:56:43.9461075Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:43.9484204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9487056Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9488255Z ^ 2025-05-07T19:56:43.9488511Z 2025-05-07T19:56:43.9488976Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:43.9489641Z 2025-05-07T19:56:43.9491225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9493847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9495055Z ^ 2025-05-07T19:56:43.9495420Z 2025-05-07T19:56:43.9496907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9499028Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:43.9499750Z ^ 2025-05-07T19:56:43.9500430Z 2025-05-07T19:56:43.9501574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9503225Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9503954Z ^ 2025-05-07T19:56:43.9504272Z 2025-05-07T19:56:43.9505717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9507780Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9508301Z ^ 2025-05-07T19:56:43.9508585Z 2025-05-07T19:56:43.9510085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9511994Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9512576Z ^ 2025-05-07T19:56:43.9512852Z 2025-05-07T19:56:43.9514485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9517105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9518234Z ^ 2025-05-07T19:56:43.9518495Z 2025-05-07T19:56:43.9518928Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:43.9519607Z 2025-05-07T19:56:43.9521218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9524170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9525304Z ^ 2025-05-07T19:56:43.9525696Z 2025-05-07T19:56:43.9527172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9529203Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:43.9529954Z ^ 2025-05-07T19:56:43.9530235Z 2025-05-07T19:56:43.9531717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9533569Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9534148Z ^ 2025-05-07T19:56:43.9534422Z 2025-05-07T19:56:43.9535932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9537770Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9538345Z ^ 2025-05-07T19:56:43.9538620Z 2025-05-07T19:56:43.9540174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9542035Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9542578Z ^ 2025-05-07T19:56:43.9543110Z 2025-05-07T19:56:43.9544699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9547570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9548762Z ^ 2025-05-07T19:56:43.9549018Z 2025-05-07T19:56:43.9549454Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:43.9550092Z 2025-05-07T19:56:43.9551719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9554494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9555691Z ^ 2025-05-07T19:56:43.9556037Z 2025-05-07T19:56:43.9557564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9559587Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:43.9560335Z ^ 2025-05-07T19:56:43.9560615Z 2025-05-07T19:56:43.9562029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9563888Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9564452Z ^ 2025-05-07T19:56:43.9564741Z 2025-05-07T19:56:43.9566170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9568027Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9568590Z ^ 2025-05-07T19:56:43.9568853Z 2025-05-07T19:56:43.9570285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9572140Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9572664Z ^ 2025-05-07T19:56:43.9572961Z 2025-05-07T19:56:43.9574509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9577122Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9578236Z ^ 2025-05-07T19:56:43.9578523Z 2025-05-07T19:56:43.9578964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:43.9579606Z 2025-05-07T19:56:43.9581322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9583917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9585087Z ^ 2025-05-07T19:56:43.9585443Z 2025-05-07T19:56:43.9587062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9589120Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:43.9589877Z ^ 2025-05-07T19:56:43.9592531Z 2025-05-07T19:56:43.9594100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9595999Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9596678Z ^ 2025-05-07T19:56:43.9597000Z 2025-05-07T19:56:43.9598473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9600333Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9600900Z ^ 2025-05-07T19:56:43.9601219Z 2025-05-07T19:56:43.9602651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9604506Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9605001Z ^ 2025-05-07T19:56:43.9605263Z 2025-05-07T19:56:43.9606773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9609490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9610724Z ^ 2025-05-07T19:56:43.9610952Z 2025-05-07T19:56:43.9611309Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:43.9611816Z 2025-05-07T19:56:43.9613231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9615646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9616659Z ^ 2025-05-07T19:56:43.9616957Z 2025-05-07T19:56:43.9618199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9620281Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:43.9621018Z ^ 2025-05-07T19:56:43.9621463Z 2025-05-07T19:56:43.9623130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9624883Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9625442Z ^ 2025-05-07T19:56:43.9625720Z 2025-05-07T19:56:43.9627095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9628919Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9629453Z ^ 2025-05-07T19:56:43.9630049Z 2025-05-07T19:56:43.9631500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9633375Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9633910Z ^ 2025-05-07T19:56:43.9634454Z 2025-05-07T19:56:44.2393006Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:44.2414692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.2417348Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:44.2418492Z ^ 2025-05-07T19:56:44.2418740Z 2025-05-07T19:56:44.2419162Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:44.2419812Z 2025-05-07T19:56:44.2421556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.2424290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:44.2425636Z ^ 2025-05-07T19:56:44.2425994Z 2025-05-07T19:56:44.2427691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2429726Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:44.2430547Z ^ 2025-05-07T19:56:44.2430819Z 2025-05-07T19:56:44.2432271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2434283Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2434805Z ^ 2025-05-07T19:56:44.2435101Z 2025-05-07T19:56:44.2436444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2438080Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2438581Z ^ 2025-05-07T19:56:44.2438859Z 2025-05-07T19:56:44.2440263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2442020Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2442528Z ^ 2025-05-07T19:56:44.2442763Z 2025-05-07T19:56:44.2444171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.2446573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:44.2447633Z ^ 2025-05-07T19:56:44.2447877Z 2025-05-07T19:56:44.2448319Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:44.2448929Z 2025-05-07T19:56:44.2450406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.2452981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:44.2453964Z ^ 2025-05-07T19:56:44.2454252Z 2025-05-07T19:56:44.2455507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2457395Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:44.2458054Z ^ 2025-05-07T19:56:44.2458309Z 2025-05-07T19:56:44.2459620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2461570Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2462071Z ^ 2025-05-07T19:56:44.2462334Z 2025-05-07T19:56:44.2463805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2465629Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2466165Z ^ 2025-05-07T19:56:44.2466408Z 2025-05-07T19:56:44.2467957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2469721Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2470222Z ^ 2025-05-07T19:56:44.2470462Z 2025-05-07T19:56:44.2471921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.2474478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:44.2475544Z ^ 2025-05-07T19:56:44.2475762Z 2025-05-07T19:56:44.2476158Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:44.2476742Z 2025-05-07T19:56:44.2478275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.2480802Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:44.2481933Z ^ 2025-05-07T19:56:44.2482253Z 2025-05-07T19:56:44.2483648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2485559Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:44.2486075Z ^ 2025-05-07T19:56:44.2486332Z 2025-05-07T19:56:44.2487727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2489521Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2490055Z ^ 2025-05-07T19:56:44.2490304Z 2025-05-07T19:56:44.2491688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2493507Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2494033Z ^ 2025-05-07T19:56:44.2494293Z 2025-05-07T19:56:44.2495657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2497456Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2497989Z ^ 2025-05-07T19:56:44.2498252Z 2025-05-07T19:56:44.2499572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.2502215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:44.2503285Z ^ 2025-05-07T19:56:44.2503539Z 2025-05-07T19:56:44.2503967Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:44.2504744Z 2025-05-07T19:56:44.2506332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.2508964Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:44.2510068Z ^ 2025-05-07T19:56:44.2510405Z 2025-05-07T19:56:44.2511766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2513777Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:44.2514476Z ^ 2025-05-07T19:56:44.2514727Z 2025-05-07T19:56:44.2516099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2517889Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2518434Z ^ 2025-05-07T19:56:44.2518694Z 2025-05-07T19:56:44.2520004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2521766Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2522461Z ^ 2025-05-07T19:56:44.2522709Z 2025-05-07T19:56:44.2524062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2525857Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2526334Z ^ 2025-05-07T19:56:44.2526594Z 2025-05-07T19:56:44.2528061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.2530530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:44.2531619Z ^ 2025-05-07T19:56:44.2531890Z 2025-05-07T19:56:44.2532336Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:44.2532923Z 2025-05-07T19:56:44.2534349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.2536897Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:44.2538005Z ^ 2025-05-07T19:56:44.2538354Z 2025-05-07T19:56:44.2539745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2541908Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:44.2542626Z ^ 2025-05-07T19:56:44.2542879Z 2025-05-07T19:56:44.2544287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2546380Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2546876Z ^ 2025-05-07T19:56:44.2547152Z 2025-05-07T19:56:44.2548752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2550476Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2550969Z ^ 2025-05-07T19:56:44.2551234Z 2025-05-07T19:56:44.2552646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:44.2554529Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:44.2554990Z ^ 2025-05-07T19:56:44.2555234Z 2025-05-07T19:56:50.2499474Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:50.2523776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2526429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2527958Z ^ 2025-05-07T19:56:50.2528212Z 2025-05-07T19:56:50.2528649Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.2529291Z 2025-05-07T19:56:50.2531254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2533902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2535301Z ^ 2025-05-07T19:56:50.2535664Z 2025-05-07T19:56:50.2537433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2540572Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2541775Z ^ 2025-05-07T19:56:50.2542029Z 2025-05-07T19:56:50.2542501Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.2543253Z 2025-05-07T19:56:50.2545037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2547787Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2548943Z ^ 2025-05-07T19:56:50.2549316Z 2025-05-07T19:56:50.2551138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2553914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2555154Z ^ 2025-05-07T19:56:50.2555400Z 2025-05-07T19:56:50.2555869Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.2556536Z 2025-05-07T19:56:50.2558314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2561282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2562546Z ^ 2025-05-07T19:56:50.2562922Z 2025-05-07T19:56:50.2564643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2567584Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2568866Z ^ 2025-05-07T19:56:50.2569137Z 2025-05-07T19:56:50.2569605Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.2570310Z 2025-05-07T19:56:50.2571975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2574313Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2575475Z ^ 2025-05-07T19:56:50.2575802Z 2025-05-07T19:56:50.2577531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2580403Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2581628Z ^ 2025-05-07T19:56:50.2581998Z 2025-05-07T19:56:50.2582469Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.2583110Z 2025-05-07T19:56:50.2584930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2587768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2589041Z ^ 2025-05-07T19:56:50.2589435Z 2025-05-07T19:56:53.9370903Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:56:53.9392231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.9395110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.9396429Z ^ 2025-05-07T19:56:53.9396675Z 2025-05-07T19:56:53.9397094Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:53.9397742Z 2025-05-07T19:56:53.9399287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.9401877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.9402986Z ^ 2025-05-07T19:56:53.9403336Z 2025-05-07T19:56:53.9404883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.9407393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.9408514Z ^ 2025-05-07T19:56:53.9408750Z 2025-05-07T19:56:53.9409178Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:53.9409808Z 2025-05-07T19:56:53.9411403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.9414001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.9415165Z ^ 2025-05-07T19:56:53.9415512Z 2025-05-07T19:56:53.9417125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.9420012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.9421154Z ^ 2025-05-07T19:56:53.9421406Z 2025-05-07T19:56:53.9421825Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:53.9422703Z 2025-05-07T19:56:53.9424304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.9426864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.9428023Z ^ 2025-05-07T19:56:53.9428362Z 2025-05-07T19:56:53.9429931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.9432453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.9433488Z ^ 2025-05-07T19:56:53.9433729Z 2025-05-07T19:56:53.9434111Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:53.9434869Z 2025-05-07T19:56:53.9436387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.9438949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.9440065Z ^ 2025-05-07T19:56:53.9440385Z 2025-05-07T19:56:53.9441914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.9444517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.9445593Z ^ 2025-05-07T19:56:53.9445828Z 2025-05-07T19:56:53.9446251Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:53.9446946Z 2025-05-07T19:56:53.9448494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.9450875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.9451947Z ^ 2025-05-07T19:56:53.9452305Z 2025-05-07T19:56:55.0650138Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:56:55.0673258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.0676036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.0677335Z ^ 2025-05-07T19:56:55.0677577Z 2025-05-07T19:56:55.0678039Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.0678734Z 2025-05-07T19:56:55.0680458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.0683039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.0684126Z ^ 2025-05-07T19:56:55.0684475Z 2025-05-07T19:56:55.0686042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.0688651Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.0689788Z ^ 2025-05-07T19:56:55.0689995Z 2025-05-07T19:56:55.0690374Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.0690998Z 2025-05-07T19:56:55.0692769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.0695435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.0696444Z ^ 2025-05-07T19:56:55.0696761Z 2025-05-07T19:56:55.0698274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.0701166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.0702234Z ^ 2025-05-07T19:56:55.0702460Z 2025-05-07T19:56:55.0702890Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.0703506Z 2025-05-07T19:56:55.0705252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.0707854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.0708962Z ^ 2025-05-07T19:56:55.0709303Z 2025-05-07T19:56:55.0710838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.0713468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.0714952Z ^ 2025-05-07T19:56:55.0715211Z 2025-05-07T19:56:55.0715666Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.0716342Z 2025-05-07T19:56:55.0718239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.0721255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.0722867Z ^ 2025-05-07T19:56:55.0723248Z 2025-05-07T19:56:55.0725001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.0727544Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.0728667Z ^ 2025-05-07T19:56:55.0728915Z 2025-05-07T19:56:55.0729356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.0729997Z 2025-05-07T19:56:55.0731619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.0734226Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.0735405Z ^ 2025-05-07T19:56:55.0735768Z 2025-05-07T19:56:56.8575009Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:56.8597908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.8600653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.8601766Z ^ 2025-05-07T19:56:56.8602021Z 2025-05-07T19:56:56.8602433Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.8603096Z 2025-05-07T19:56:56.8604652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.8606941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.8607983Z ^ 2025-05-07T19:56:56.8608272Z 2025-05-07T19:56:56.8609854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.8612618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.8613760Z ^ 2025-05-07T19:56:56.8614023Z 2025-05-07T19:56:56.8614430Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.8615042Z 2025-05-07T19:56:56.8616652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.8619294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.8620632Z ^ 2025-05-07T19:56:56.8620994Z 2025-05-07T19:56:56.8622914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.8625595Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.8626787Z ^ 2025-05-07T19:56:56.8627039Z 2025-05-07T19:56:56.8627494Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.8628187Z 2025-05-07T19:56:56.8629902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.8632499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.8633554Z ^ 2025-05-07T19:56:56.8634129Z 2025-05-07T19:56:56.8635791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.8638627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.8639733Z ^ 2025-05-07T19:56:56.8639970Z 2025-05-07T19:56:56.8640388Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.8641037Z 2025-05-07T19:56:56.8642703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.8645342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.8646431Z ^ 2025-05-07T19:56:56.8646783Z 2025-05-07T19:56:56.8648486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.8651236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.8652391Z ^ 2025-05-07T19:56:56.8652665Z 2025-05-07T19:56:56.8653080Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.8653749Z 2025-05-07T19:56:56.8655410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.8658079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.8659281Z ^ 2025-05-07T19:56:56.8659648Z 2025-05-07T19:56:56.9743574Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:56.9766683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.9769396Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.9770547Z ^ 2025-05-07T19:56:56.9770788Z 2025-05-07T19:56:56.9771253Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.9771839Z 2025-05-07T19:56:56.9773462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.9776052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.9777310Z ^ 2025-05-07T19:56:56.9777699Z 2025-05-07T19:56:56.9779407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.9782343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.9783604Z ^ 2025-05-07T19:56:56.9783869Z 2025-05-07T19:56:56.9784316Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.9785045Z 2025-05-07T19:56:56.9786834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.9789670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.9790894Z ^ 2025-05-07T19:56:56.9791282Z 2025-05-07T19:56:56.9793024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.9795640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.9796766Z ^ 2025-05-07T19:56:56.9797022Z 2025-05-07T19:56:56.9797463Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.9798147Z 2025-05-07T19:56:56.9799859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.9802857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.9804126Z ^ 2025-05-07T19:56:56.9804507Z 2025-05-07T19:56:56.9806419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.9809235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.9810547Z ^ 2025-05-07T19:56:56.9810810Z 2025-05-07T19:56:56.9811269Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.9811972Z 2025-05-07T19:56:56.9813770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.9816461Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.9817655Z ^ 2025-05-07T19:56:56.9817980Z 2025-05-07T19:56:56.9819932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.9822683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.9823676Z ^ 2025-05-07T19:56:56.9823890Z 2025-05-07T19:56:56.9824332Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.9824978Z 2025-05-07T19:56:56.9826662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.9829440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.9830694Z ^ 2025-05-07T19:56:56.9831049Z 2025-05-07T19:57:03.6024068Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:57:03.6048432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6051028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6052230Z ^ 2025-05-07T19:57:03.6052468Z 2025-05-07T19:57:03.6052883Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.6053505Z 2025-05-07T19:57:03.6055130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6057805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6058962Z ^ 2025-05-07T19:57:03.6059321Z 2025-05-07T19:57:03.6061129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6063764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6064956Z ^ 2025-05-07T19:57:03.6065237Z 2025-05-07T19:57:03.6065700Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.6066346Z 2025-05-07T19:57:03.6068041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6070733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6071956Z ^ 2025-05-07T19:57:03.6072314Z 2025-05-07T19:57:03.6073979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6076576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6077777Z ^ 2025-05-07T19:57:03.6078023Z 2025-05-07T19:57:03.6078464Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.6079307Z 2025-05-07T19:57:03.6080884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6083758Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6084977Z ^ 2025-05-07T19:57:03.6085353Z 2025-05-07T19:57:03.6087023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6089856Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6091016Z ^ 2025-05-07T19:57:03.6091296Z 2025-05-07T19:57:03.6091735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.6092409Z 2025-05-07T19:57:03.6094124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6096894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6098144Z ^ 2025-05-07T19:57:03.6098528Z 2025-05-07T19:57:03.6100335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6102640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6103745Z ^ 2025-05-07T19:57:03.6103979Z 2025-05-07T19:57:03.6104404Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.6105069Z 2025-05-07T19:57:03.6106676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6109289Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6110395Z ^ 2025-05-07T19:57:03.6110773Z 2025-05-07T19:57:10.4902213Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:10.4924278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4926908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.4928047Z ^ 2025-05-07T19:57:10.4928295Z 2025-05-07T19:57:10.4928757Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.4929351Z 2025-05-07T19:57:10.4930801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4933238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.4934320Z ^ 2025-05-07T19:57:10.4934647Z 2025-05-07T19:57:10.4936079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4938598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.4939834Z ^ 2025-05-07T19:57:10.4940105Z 2025-05-07T19:57:10.4940512Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.4941139Z 2025-05-07T19:57:10.4942708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4945214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.4946388Z ^ 2025-05-07T19:57:10.4946747Z 2025-05-07T19:57:10.4948354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4951250Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.4952364Z ^ 2025-05-07T19:57:10.4952607Z 2025-05-07T19:57:10.4953024Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.4953682Z 2025-05-07T19:57:10.4955411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4957854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.4959085Z ^ 2025-05-07T19:57:10.4959459Z 2025-05-07T19:57:10.4961002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4963596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.4964740Z ^ 2025-05-07T19:57:10.4965034Z 2025-05-07T19:57:10.4965483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.4966120Z 2025-05-07T19:57:10.4967746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4970303Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.4971391Z ^ 2025-05-07T19:57:10.4971738Z 2025-05-07T19:57:10.4973208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4975618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.4976728Z ^ 2025-05-07T19:57:10.4976966Z 2025-05-07T19:57:10.4977387Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.4978038Z 2025-05-07T19:57:10.4979556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4982292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.4983350Z ^ 2025-05-07T19:57:10.4983645Z 2025-05-07T19:57:12.3232997Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:12.3256265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3258935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3260213Z ^ 2025-05-07T19:57:12.3260455Z 2025-05-07T19:57:12.3260888Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.3261571Z 2025-05-07T19:57:12.3263212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3265911Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3267127Z ^ 2025-05-07T19:57:12.3267490Z 2025-05-07T19:57:12.3268907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3271446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3272612Z ^ 2025-05-07T19:57:12.3272865Z 2025-05-07T19:57:12.3273289Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.3273930Z 2025-05-07T19:57:12.3275391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3278200Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3279597Z ^ 2025-05-07T19:57:12.3279957Z 2025-05-07T19:57:12.3281609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3284508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3285709Z ^ 2025-05-07T19:57:12.3285989Z 2025-05-07T19:57:12.3286456Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.3287227Z 2025-05-07T19:57:12.3288843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3291426Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3292545Z ^ 2025-05-07T19:57:12.3292896Z 2025-05-07T19:57:12.3294531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3297235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3298410Z ^ 2025-05-07T19:57:12.3298657Z 2025-05-07T19:57:12.3299086Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.3299843Z 2025-05-07T19:57:12.3301496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3304252Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3305459Z ^ 2025-05-07T19:57:12.3305844Z 2025-05-07T19:57:12.3307419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3310065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3311154Z ^ 2025-05-07T19:57:12.3311402Z 2025-05-07T19:57:12.3311808Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.3312473Z 2025-05-07T19:57:12.3314113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3316743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3317964Z ^ 2025-05-07T19:57:12.3318322Z 2025-05-07T19:57:13.8608977Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:13.8633913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.8636423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.8637560Z ^ 2025-05-07T19:57:13.8637779Z 2025-05-07T19:57:13.8638199Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.8638860Z 2025-05-07T19:57:13.8640351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.8642785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.8643909Z ^ 2025-05-07T19:57:13.8644282Z 2025-05-07T19:57:13.8645842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.8648232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.8649379Z ^ 2025-05-07T19:57:13.8649635Z 2025-05-07T19:57:13.8650077Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.8650739Z 2025-05-07T19:57:13.8652429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.8655447Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.8656626Z ^ 2025-05-07T19:57:13.8657209Z 2025-05-07T19:57:13.8658892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.8661587Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.8662522Z ^ 2025-05-07T19:57:13.8662724Z 2025-05-07T19:57:13.8663074Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.8663621Z 2025-05-07T19:57:13.8664928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.8667081Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.8668025Z ^ 2025-05-07T19:57:13.8668326Z 2025-05-07T19:57:13.8669623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.8671767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.8672706Z ^ 2025-05-07T19:57:13.8672931Z 2025-05-07T19:57:13.8673289Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.8673826Z 2025-05-07T19:57:13.8675209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.8677482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.8678531Z ^ 2025-05-07T19:57:13.8678876Z 2025-05-07T19:57:13.8680413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.8682959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.8684028Z ^ 2025-05-07T19:57:13.8684244Z 2025-05-07T19:57:13.8684640Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.8685248Z 2025-05-07T19:57:13.8686765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.8689121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.8690189Z ^ 2025-05-07T19:57:13.8690521Z 2025-05-07T19:57:15.7390080Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:15.7413307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7415970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7417161Z ^ 2025-05-07T19:57:15.7417392Z 2025-05-07T19:57:15.7417815Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:15.7418399Z 2025-05-07T19:57:15.7420144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7422949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7423994Z ^ 2025-05-07T19:57:15.7424323Z 2025-05-07T19:57:15.7425806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7428429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7429833Z ^ 2025-05-07T19:57:15.7430044Z 2025-05-07T19:57:15.7430465Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:15.7431081Z 2025-05-07T19:57:15.7433076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7435793Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7437007Z ^ 2025-05-07T19:57:15.7437502Z 2025-05-07T19:57:15.7439190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7441702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7442872Z ^ 2025-05-07T19:57:15.7443110Z 2025-05-07T19:57:15.7443521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:15.7444147Z 2025-05-07T19:57:15.7445713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7448325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7449539Z ^ 2025-05-07T19:57:15.7449903Z 2025-05-07T19:57:15.7451615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7454291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7455476Z ^ 2025-05-07T19:57:15.7455740Z 2025-05-07T19:57:15.7456228Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:15.7456828Z 2025-05-07T19:57:15.7458404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7461211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7462430Z ^ 2025-05-07T19:57:15.7462811Z 2025-05-07T19:57:15.7464415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7467120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7468273Z ^ 2025-05-07T19:57:15.7468534Z 2025-05-07T19:57:15.7468986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:15.7469638Z 2025-05-07T19:57:15.7471372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7474072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7475452Z ^ 2025-05-07T19:57:15.7475810Z 2025-05-07T19:57:19.7180878Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:57:19.7204709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.7207351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.7208436Z ^ 2025-05-07T19:57:19.7208676Z 2025-05-07T19:57:19.7209092Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:19.7209749Z 2025-05-07T19:57:19.7211298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.7213908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.7215097Z ^ 2025-05-07T19:57:19.7215462Z 2025-05-07T19:57:19.7217120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.7220246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.7221421Z ^ 2025-05-07T19:57:19.7221874Z 2025-05-07T19:57:19.7222543Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:19.7223210Z 2025-05-07T19:57:19.7224923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.7227778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.7228991Z ^ 2025-05-07T19:57:19.7229363Z 2025-05-07T19:57:19.7231008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.7233759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.7234972Z ^ 2025-05-07T19:57:19.7235197Z 2025-05-07T19:57:19.7235591Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:19.7236235Z 2025-05-07T19:57:19.7237846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.7240378Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.7241566Z ^ 2025-05-07T19:57:19.7241935Z 2025-05-07T19:57:19.7243585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.7246265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.7247462Z ^ 2025-05-07T19:57:19.7247730Z 2025-05-07T19:57:19.7248200Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:19.7248888Z 2025-05-07T19:57:19.7250635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.7253411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.7254545Z ^ 2025-05-07T19:57:19.7254913Z 2025-05-07T19:57:19.7256526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.7259325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.7260677Z ^ 2025-05-07T19:57:19.7260928Z 2025-05-07T19:57:19.7261375Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:19.7262048Z 2025-05-07T19:57:19.7264085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.7267096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.7268343Z ^ 2025-05-07T19:57:19.7268715Z 2025-05-07T19:57:22.6091013Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:57:22.6114811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.6117474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.6118542Z ^ 2025-05-07T19:57:22.6118772Z 2025-05-07T19:57:22.6119197Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:22.6119870Z 2025-05-07T19:57:22.6121536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.6124742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.6125822Z ^ 2025-05-07T19:57:22.6126178Z 2025-05-07T19:57:22.6128109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.6130749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.6131957Z ^ 2025-05-07T19:57:22.6132197Z 2025-05-07T19:57:22.6132636Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:22.6133293Z 2025-05-07T19:57:22.6134969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.6137656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.6138830Z ^ 2025-05-07T19:57:22.6139213Z 2025-05-07T19:57:22.6140996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.6143682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.6144793Z ^ 2025-05-07T19:57:22.6145049Z 2025-05-07T19:57:22.6145476Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:22.6146111Z 2025-05-07T19:57:22.6147753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.6150308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.6151378Z ^ 2025-05-07T19:57:22.6151704Z 2025-05-07T19:57:22.6153175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.6155554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.6156718Z ^ 2025-05-07T19:57:22.6156961Z 2025-05-07T19:57:22.6157372Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:22.6158015Z 2025-05-07T19:57:22.6159553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.6162170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.6163358Z ^ 2025-05-07T19:57:22.6163737Z 2025-05-07T19:57:22.6165287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.6167774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.6169225Z ^ 2025-05-07T19:57:22.6169488Z 2025-05-07T19:57:22.6169932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:22.6170592Z 2025-05-07T19:57:22.6172384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.6175016Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.6176308Z ^ 2025-05-07T19:57:22.6176655Z 2025-05-07T19:57:24.8299020Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:24.8323229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8325986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:24.8327193Z ^ 2025-05-07T19:57:24.8327438Z 2025-05-07T19:57:24.8328135Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:24.8328758Z 2025-05-07T19:57:24.8330137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8332557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:24.8333634Z ^ 2025-05-07T19:57:24.8333976Z 2025-05-07T19:57:24.8335624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8338503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:24.8339819Z ^ 2025-05-07T19:57:24.8340086Z 2025-05-07T19:57:24.8340546Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:24.8341236Z 2025-05-07T19:57:24.8342995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8345817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:24.8347009Z ^ 2025-05-07T19:57:24.8347378Z 2025-05-07T19:57:24.8349113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8351912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:24.8353138Z ^ 2025-05-07T19:57:24.8353396Z 2025-05-07T19:57:24.8353850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:24.8354471Z 2025-05-07T19:57:24.8356116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8358703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:24.8359859Z ^ 2025-05-07T19:57:24.8360214Z 2025-05-07T19:57:24.8361779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8364368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:24.8365530Z ^ 2025-05-07T19:57:24.8365785Z 2025-05-07T19:57:24.8366223Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:24.8366830Z 2025-05-07T19:57:24.8368443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8370977Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:24.8372349Z ^ 2025-05-07T19:57:24.8372697Z 2025-05-07T19:57:24.8374274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8376887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:24.8377968Z ^ 2025-05-07T19:57:24.8378209Z 2025-05-07T19:57:24.8378620Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:24.8379327Z 2025-05-07T19:57:24.8381095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8383775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:24.8384955Z ^ 2025-05-07T19:57:24.8385307Z 2025-05-07T19:57:36.6671895Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:36.6694476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.6697565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:36.6698706Z ^ 2025-05-07T19:57:36.6698963Z 2025-05-07T19:57:36.6699824Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:36.6700489Z 2025-05-07T19:57:36.6702134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.6704902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:36.6705959Z ^ 2025-05-07T19:57:36.6706315Z 2025-05-07T19:57:36.6707951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.6710539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:36.6711724Z ^ 2025-05-07T19:57:36.6711979Z 2025-05-07T19:57:36.6726401Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:36.6727189Z 2025-05-07T19:57:36.6728860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.6731557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:36.6732693Z ^ 2025-05-07T19:57:36.6733028Z 2025-05-07T19:57:36.6734700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.6737373Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:36.6738572Z ^ 2025-05-07T19:57:36.6738832Z 2025-05-07T19:57:36.6739253Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:36.6739997Z 2025-05-07T19:57:36.6741604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.6744305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:36.6745538Z ^ 2025-05-07T19:57:36.6745898Z 2025-05-07T19:57:36.6747642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.6750295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:36.6751362Z ^ 2025-05-07T19:57:36.6751574Z 2025-05-07T19:57:36.6751952Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:36.6752500Z 2025-05-07T19:57:36.6753925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.6756897Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:36.6758095Z ^ 2025-05-07T19:57:36.6758658Z 2025-05-07T19:57:36.6760383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.6763238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:36.6764411Z ^ 2025-05-07T19:57:36.6764683Z 2025-05-07T19:57:36.6765143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:36.6765833Z 2025-05-07T19:57:36.6767602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.6770409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:36.6771589Z ^ 2025-05-07T19:57:36.6771935Z 2025-05-07T19:57:39.3154372Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:39.3177771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.3180818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.3182028Z ^ 2025-05-07T19:57:39.3182278Z 2025-05-07T19:57:39.3182739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.3183522Z 2025-05-07T19:57:39.3185259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.3188050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.3189254Z ^ 2025-05-07T19:57:39.3189625Z 2025-05-07T19:57:39.3191352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.3194075Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.3195299Z ^ 2025-05-07T19:57:39.3195554Z 2025-05-07T19:57:39.3196017Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.3196707Z 2025-05-07T19:57:39.3198418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.3201136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.3202364Z ^ 2025-05-07T19:57:39.3202740Z 2025-05-07T19:57:39.3204449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.3207188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.3208358Z ^ 2025-05-07T19:57:39.3208625Z 2025-05-07T19:57:39.3209070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.3209750Z 2025-05-07T19:57:39.3211462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.3214164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.3215371Z ^ 2025-05-07T19:57:39.3215735Z 2025-05-07T19:57:39.3217444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.3220266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.3221448Z ^ 2025-05-07T19:57:39.3221662Z 2025-05-07T19:57:39.3222263Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.3222852Z 2025-05-07T19:57:39.3224651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.3227285Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.3228191Z ^ 2025-05-07T19:57:39.3228622Z 2025-05-07T19:57:39.3230174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.3232899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.3234086Z ^ 2025-05-07T19:57:39.3234350Z 2025-05-07T19:57:39.3234790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.3235457Z 2025-05-07T19:57:39.3237173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.3239903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.3241120Z ^ 2025-05-07T19:57:39.3241491Z 2025-05-07T19:57:42.0269442Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:42.0292013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0294860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:42.0295928Z ^ 2025-05-07T19:57:42.0296160Z 2025-05-07T19:57:42.0296572Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:42.0297209Z 2025-05-07T19:57:42.0298735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0301401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:42.0302506Z ^ 2025-05-07T19:57:42.0302821Z 2025-05-07T19:57:42.0304383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0306849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:42.0307976Z ^ 2025-05-07T19:57:42.0308227Z 2025-05-07T19:57:42.0308649Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:42.0309256Z 2025-05-07T19:57:42.0310807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0313361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:42.0314475Z ^ 2025-05-07T19:57:42.0314811Z 2025-05-07T19:57:42.0316387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0318867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:42.0319989Z ^ 2025-05-07T19:57:42.0320247Z 2025-05-07T19:57:42.0320660Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:42.0321274Z 2025-05-07T19:57:42.0323074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0325561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:42.0326691Z ^ 2025-05-07T19:57:42.0327039Z 2025-05-07T19:57:42.0328547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0331294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:42.0332391Z ^ 2025-05-07T19:57:42.0332622Z 2025-05-07T19:57:42.0333285Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:42.0333938Z 2025-05-07T19:57:42.0335443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0338104Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:42.0339218Z ^ 2025-05-07T19:57:42.0339732Z 2025-05-07T19:57:42.0341227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0343746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:42.0344825Z ^ 2025-05-07T19:57:42.0345087Z 2025-05-07T19:57:42.0345535Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:42.0346173Z 2025-05-07T19:57:42.0347759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0350214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:42.0351422Z ^ 2025-05-07T19:57:42.0351772Z 2025-05-07T19:57:44.4534165Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:44.4555404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.4557899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.4558969Z ^ 2025-05-07T19:57:44.4559210Z 2025-05-07T19:57:44.4559614Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:44.4560228Z 2025-05-07T19:57:44.4561727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.4564162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.4565229Z ^ 2025-05-07T19:57:44.4565543Z 2025-05-07T19:57:44.4566990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.4569399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.4570441Z ^ 2025-05-07T19:57:44.4570679Z 2025-05-07T19:57:44.4571077Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:44.4571701Z 2025-05-07T19:57:44.4573149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.4575554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.4576643Z ^ 2025-05-07T19:57:44.4576984Z 2025-05-07T19:57:44.4578633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.4581163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.4582222Z ^ 2025-05-07T19:57:44.4582448Z 2025-05-07T19:57:44.4582868Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:44.4583425Z 2025-05-07T19:57:44.4584805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.4587155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.4588433Z ^ 2025-05-07T19:57:44.4588770Z 2025-05-07T19:57:44.4590409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.4592765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.4593829Z ^ 2025-05-07T19:57:44.4594178Z 2025-05-07T19:57:44.4594555Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:44.4595110Z 2025-05-07T19:57:44.4596569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.4598903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.4599777Z ^ 2025-05-07T19:57:44.4600043Z 2025-05-07T19:57:44.4601315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.4603623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.4604721Z ^ 2025-05-07T19:57:44.4604939Z 2025-05-07T19:57:44.4605363Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:44.4605962Z 2025-05-07T19:57:44.4607405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.4609812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.4610881Z ^ 2025-05-07T19:57:44.4611214Z 2025-05-07T19:57:44.9493917Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:44.9517965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9520472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.9521636Z ^ 2025-05-07T19:57:44.9521927Z 2025-05-07T19:57:44.9522625Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:44.9523323Z 2025-05-07T19:57:44.9525070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9527826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.9529065Z ^ 2025-05-07T19:57:44.9529437Z 2025-05-07T19:57:44.9531143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9534003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.9535260Z ^ 2025-05-07T19:57:44.9535526Z 2025-05-07T19:57:44.9536005Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:44.9536689Z 2025-05-07T19:57:44.9538453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9541410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.9542633Z ^ 2025-05-07T19:57:44.9543012Z 2025-05-07T19:57:44.9544508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9546397Z int error_code = 0; 2025-05-07T19:57:44.9546845Z ^ 2025-05-07T19:57:44.9547083Z 2025-05-07T19:57:44.9548547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9550401Z int64_t error_value; 2025-05-07T19:57:44.9551091Z ^ 2025-05-07T19:57:44.9551309Z 2025-05-07T19:57:44.9552770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9554598Z int error_code = 0; 2025-05-07T19:57:44.9555237Z ^ 2025-05-07T19:57:44.9555459Z 2025-05-07T19:57:44.9556912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9558870Z int64_t error_value; 2025-05-07T19:57:44.9559336Z ^ 2025-05-07T19:57:44.9559575Z 2025-05-07T19:57:44.9561039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9562920Z int error_code = 0; 2025-05-07T19:57:44.9563364Z ^ 2025-05-07T19:57:44.9563591Z 2025-05-07T19:57:44.9565051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9566938Z int64_t error_value; 2025-05-07T19:57:44.9567399Z ^ 2025-05-07T19:57:44.9567640Z 2025-05-07T19:57:44.9569077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9570920Z int error_code = 0; 2025-05-07T19:57:44.9571348Z ^ 2025-05-07T19:57:44.9571569Z 2025-05-07T19:57:44.9573155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9575092Z int64_t error_value; 2025-05-07T19:57:44.9575562Z ^ 2025-05-07T19:57:44.9575794Z 2025-05-07T19:57:44.9577641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9580673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.9581958Z ^ 2025-05-07T19:57:44.9582238Z 2025-05-07T19:57:44.9582731Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:44.9583459Z 2025-05-07T19:57:44.9585298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9588239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.9589536Z ^ 2025-05-07T19:57:44.9589918Z 2025-05-07T19:57:44.9591436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9593367Z int error_code = 0; 2025-05-07T19:57:44.9593809Z ^ 2025-05-07T19:57:44.9594049Z 2025-05-07T19:57:44.9595571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9597498Z int64_t error_value; 2025-05-07T19:57:44.9598109Z ^ 2025-05-07T19:57:44.9598348Z 2025-05-07T19:57:44.9599846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9601758Z int error_code = 0; 2025-05-07T19:57:44.9602357Z ^ 2025-05-07T19:57:44.9602578Z 2025-05-07T19:57:44.9604136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9606051Z int64_t error_value; 2025-05-07T19:57:44.9606636Z ^ 2025-05-07T19:57:44.9606875Z 2025-05-07T19:57:44.9608390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9610289Z int error_code = 0; 2025-05-07T19:57:44.9610752Z ^ 2025-05-07T19:57:44.9610967Z 2025-05-07T19:57:44.9612466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9614500Z int64_t error_value; 2025-05-07T19:57:44.9614965Z ^ 2025-05-07T19:57:44.9615209Z 2025-05-07T19:57:44.9616756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9618723Z int error_code = 0; 2025-05-07T19:57:44.9619179Z ^ 2025-05-07T19:57:44.9619405Z 2025-05-07T19:57:44.9621074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9623280Z int64_t error_value; 2025-05-07T19:57:44.9623725Z ^ 2025-05-07T19:57:44.9623961Z 2025-05-07T19:57:44.9625786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9628826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.9630097Z ^ 2025-05-07T19:57:44.9630377Z 2025-05-07T19:57:44.9630865Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:44.9631628Z 2025-05-07T19:57:44.9633442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9636445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.9637773Z ^ 2025-05-07T19:57:44.9638170Z 2025-05-07T19:57:44.9639740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9641724Z int error_code = 0; 2025-05-07T19:57:44.9642203Z ^ 2025-05-07T19:57:44.9642423Z 2025-05-07T19:57:44.9644004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9645964Z int64_t error_value; 2025-05-07T19:57:44.9646643Z ^ 2025-05-07T19:57:44.9646894Z 2025-05-07T19:57:44.9648445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9650431Z int error_code = 0; 2025-05-07T19:57:44.9650910Z ^ 2025-05-07T19:57:44.9651339Z 2025-05-07T19:57:44.9652883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9654853Z int64_t error_value; 2025-05-07T19:57:44.9655466Z ^ 2025-05-07T19:57:44.9655706Z 2025-05-07T19:57:44.9657245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9659345Z int error_code = 0; 2025-05-07T19:57:44.9659916Z ^ 2025-05-07T19:57:44.9660139Z 2025-05-07T19:57:44.9661661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9663627Z int64_t error_value; 2025-05-07T19:57:44.9664100Z ^ 2025-05-07T19:57:44.9664343Z 2025-05-07T19:57:44.9665928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9667901Z int error_code = 0; 2025-05-07T19:57:44.9668391Z ^ 2025-05-07T19:57:44.9668606Z 2025-05-07T19:57:44.9670186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9672175Z int64_t error_value; 2025-05-07T19:57:44.9672644Z ^ 2025-05-07T19:57:44.9672894Z 2025-05-07T19:57:44.9674761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9677746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.9679065Z ^ 2025-05-07T19:57:44.9679326Z 2025-05-07T19:57:44.9679818Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:44.9680673Z 2025-05-07T19:57:44.9682607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9685684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:44.9687072Z ^ 2025-05-07T19:57:44.9687478Z 2025-05-07T19:57:44.9689124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9691174Z int error_code = 0; 2025-05-07T19:57:44.9691676Z ^ 2025-05-07T19:57:44.9691904Z 2025-05-07T19:57:44.9693572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9695586Z int64_t error_value; 2025-05-07T19:57:44.9696077Z ^ 2025-05-07T19:57:44.9696484Z 2025-05-07T19:57:44.9697995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9700089Z int error_code = 0; 2025-05-07T19:57:44.9700556Z ^ 2025-05-07T19:57:44.9700771Z 2025-05-07T19:57:44.9702494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9704501Z int64_t error_value; 2025-05-07T19:57:44.9705028Z ^ 2025-05-07T19:57:44.9705294Z 2025-05-07T19:57:44.9706834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9708793Z int error_code = 0; 2025-05-07T19:57:44.9709279Z ^ 2025-05-07T19:57:44.9709504Z 2025-05-07T19:57:44.9711072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9713057Z int64_t error_value; 2025-05-07T19:57:44.9713538Z ^ 2025-05-07T19:57:44.9713791Z 2025-05-07T19:57:44.9715312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:44.9717261Z int error_code = 0; 2025-05-07T19:57:44.9717747Z ^ 2025-05-07T19:57:44.9717967Z 2025-05-07T19:57:44.9719517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:44.9721458Z int64_t error_value; 2025-05-07T19:57:44.9722168Z ^ 2025-05-07T19:57:44.9722393Z 2025-05-07T19:57:49.2667407Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:49.2694153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.2697109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.2698411Z ^ 2025-05-07T19:57:49.2698689Z 2025-05-07T19:57:49.2699189Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.2700072Z 2025-05-07T19:57:49.2701896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.2705028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.2706366Z ^ 2025-05-07T19:57:49.2706768Z 2025-05-07T19:57:49.2708655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.2711679Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.2712898Z ^ 2025-05-07T19:57:49.2713177Z 2025-05-07T19:57:49.2713633Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.2714363Z 2025-05-07T19:57:49.2716317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.2719442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.2720800Z ^ 2025-05-07T19:57:49.2721204Z 2025-05-07T19:57:49.2723355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.2726384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.2727702Z ^ 2025-05-07T19:57:49.2727975Z 2025-05-07T19:57:49.2728473Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.2729241Z 2025-05-07T19:57:49.2731164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.2734225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.2735827Z ^ 2025-05-07T19:57:49.2736236Z 2025-05-07T19:57:49.2738262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.2741550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.2742918Z ^ 2025-05-07T19:57:49.2745385Z 2025-05-07T19:57:49.2745891Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.2746664Z 2025-05-07T19:57:49.2748614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.2751709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.2753108Z ^ 2025-05-07T19:57:49.2753527Z 2025-05-07T19:57:49.2755466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.2758565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.2759932Z ^ 2025-05-07T19:57:49.2760213Z 2025-05-07T19:57:49.2760730Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.2761517Z 2025-05-07T19:57:49.2763489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.2766558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.2767735Z ^ 2025-05-07T19:57:49.2768146Z 2025-05-07T19:57:50.7880909Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:50.7897067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.7898920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.7899901Z ^ 2025-05-07T19:57:50.7900109Z 2025-05-07T19:57:50.7900431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.7900919Z 2025-05-07T19:57:50.7902144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.7904021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.7904863Z ^ 2025-05-07T19:57:50.7905116Z 2025-05-07T19:57:50.7906260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.7908294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.7909259Z ^ 2025-05-07T19:57:50.7909474Z 2025-05-07T19:57:50.7909825Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.7910350Z 2025-05-07T19:57:50.7911670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.7913774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.7914778Z ^ 2025-05-07T19:57:50.7915072Z 2025-05-07T19:57:50.7916176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.7917963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.7918757Z ^ 2025-05-07T19:57:50.7918933Z 2025-05-07T19:57:50.7919253Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.7919916Z 2025-05-07T19:57:50.7921043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.7923375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.7924245Z ^ 2025-05-07T19:57:50.7924502Z 2025-05-07T19:57:50.7925665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.7927749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.7928598Z ^ 2025-05-07T19:57:50.7928782Z 2025-05-07T19:57:50.7929080Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.7929517Z 2025-05-07T19:57:50.7930656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.7932491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.7933293Z ^ 2025-05-07T19:57:50.7933558Z 2025-05-07T19:57:50.7934645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.7936438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.7937238Z ^ 2025-05-07T19:57:50.7937427Z 2025-05-07T19:57:50.7937738Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.7938176Z 2025-05-07T19:57:50.7939345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.7941418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.7942226Z ^ 2025-05-07T19:57:50.7942490Z 2025-05-07T19:57:51.6958498Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:57:51.6990964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.6994244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.6995712Z ^ 2025-05-07T19:57:51.6996022Z 2025-05-07T19:57:51.6996551Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.6997338Z 2025-05-07T19:57:51.6999454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.7002994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.7004591Z ^ 2025-05-07T19:57:51.7005028Z 2025-05-07T19:57:51.7007199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.7010470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.7011901Z ^ 2025-05-07T19:57:51.7012204Z 2025-05-07T19:57:51.7012772Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.7013580Z 2025-05-07T19:57:51.7015622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.7019063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.7020673Z ^ 2025-05-07T19:57:51.7021127Z 2025-05-07T19:57:51.7023192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:51.7025495Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:51.7026182Z ^ 2025-05-07T19:57:51.7026741Z 2025-05-07T19:57:51.7028858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.7032289Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.7033738Z ^ 2025-05-07T19:57:51.7034060Z 2025-05-07T19:57:51.7034585Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.7035365Z 2025-05-07T19:57:51.7037365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.7040738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.7042218Z ^ 2025-05-07T19:57:51.7042652Z 2025-05-07T19:57:51.7044287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:51.7046430Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:51.7047100Z ^ 2025-05-07T19:57:51.7047412Z 2025-05-07T19:57:51.7048917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.7051473Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.7052572Z ^ 2025-05-07T19:57:51.7052811Z 2025-05-07T19:57:51.7053232Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.7054110Z 2025-05-07T19:57:51.7056404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.7060103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.7061514Z ^ 2025-05-07T19:57:51.7061946Z 2025-05-07T19:57:51.7063443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:51.7065596Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:51.7066287Z ^ 2025-05-07T19:57:51.7066620Z 2025-05-07T19:57:51.7068825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.7072329Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.7073861Z ^ 2025-05-07T19:57:51.7074171Z 2025-05-07T19:57:51.7074715Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.7075554Z 2025-05-07T19:57:51.7077588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.7080857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.7082532Z ^ 2025-05-07T19:57:51.7082997Z 2025-05-07T19:57:51.7084686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:51.7086833Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:51.7087502Z ^ 2025-05-07T19:57:51.7087814Z 2025-05-07T19:57:52.4342501Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:57:52.4367619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.4370487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:52.4371722Z ^ 2025-05-07T19:57:52.4371995Z 2025-05-07T19:57:52.4372465Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:52.4373191Z 2025-05-07T19:57:52.4374838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.4377912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:52.4379149Z ^ 2025-05-07T19:57:52.4379668Z 2025-05-07T19:57:52.4381670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.4384503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:52.4385813Z ^ 2025-05-07T19:57:52.4386080Z 2025-05-07T19:57:52.4386557Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:52.4387251Z 2025-05-07T19:57:52.4388985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.4391852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:52.4393158Z ^ 2025-05-07T19:57:52.4393534Z 2025-05-07T19:57:52.4395272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.4398135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:52.4399631Z ^ 2025-05-07T19:57:52.4399901Z 2025-05-07T19:57:52.4400368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:52.4401090Z 2025-05-07T19:57:52.4402870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.4405730Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:52.4407021Z ^ 2025-05-07T19:57:52.4407403Z 2025-05-07T19:57:52.4409162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.4412014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:52.4413303Z ^ 2025-05-07T19:57:52.4413571Z 2025-05-07T19:57:52.4414056Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:52.4414757Z 2025-05-07T19:57:52.4416560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.4419448Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:52.4420804Z ^ 2025-05-07T19:57:52.4421214Z 2025-05-07T19:57:52.4423199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.4426208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:52.4427435Z ^ 2025-05-07T19:57:52.4427725Z 2025-05-07T19:57:52.4428187Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:52.4428882Z 2025-05-07T19:57:52.4430865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.4433652Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:52.4434932Z ^ 2025-05-07T19:57:52.4435308Z 2025-05-07T19:57:53.3885544Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:57:53.3909332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.3912355Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.3913587Z ^ 2025-05-07T19:57:53.3913794Z 2025-05-07T19:57:53.3914538Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.3915332Z 2025-05-07T19:57:53.3916990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.3919632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.3920789Z ^ 2025-05-07T19:57:53.3921145Z 2025-05-07T19:57:53.3923020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.3925762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.3927009Z ^ 2025-05-07T19:57:53.3927282Z 2025-05-07T19:57:53.3927742Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.3928415Z 2025-05-07T19:57:53.3930255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.3933442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.3934856Z ^ 2025-05-07T19:57:53.3935289Z 2025-05-07T19:57:53.3937292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.3940340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.3941423Z ^ 2025-05-07T19:57:53.3941682Z 2025-05-07T19:57:53.3942118Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.3942750Z 2025-05-07T19:57:53.3944476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.3947172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.3948304Z ^ 2025-05-07T19:57:53.3948587Z 2025-05-07T19:57:53.3950182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.3952689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.3953838Z ^ 2025-05-07T19:57:53.3954085Z 2025-05-07T19:57:53.3954520Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.3955243Z 2025-05-07T19:57:53.3956849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.3959487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.3960878Z ^ 2025-05-07T19:57:53.3961238Z 2025-05-07T19:57:53.3962749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.3965227Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.3966298Z ^ 2025-05-07T19:57:53.3966545Z 2025-05-07T19:57:53.3966960Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.3967686Z 2025-05-07T19:57:53.3969312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.3971910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.3973105Z ^ 2025-05-07T19:57:53.3973471Z 2025-05-07T19:57:53.5392504Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:57:53.5417015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.5420342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.5421561Z ^ 2025-05-07T19:57:53.5421839Z 2025-05-07T19:57:53.5422771Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.5423429Z 2025-05-07T19:57:53.5425155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.5428044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.5429268Z ^ 2025-05-07T19:57:53.5429626Z 2025-05-07T19:57:53.5431462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.5434157Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.5435369Z ^ 2025-05-07T19:57:53.5435611Z 2025-05-07T19:57:53.5435958Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.5436493Z 2025-05-07T19:57:53.5437807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.5440311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.5441436Z ^ 2025-05-07T19:57:53.5441787Z 2025-05-07T19:57:53.5443364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.5445927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.5447077Z ^ 2025-05-07T19:57:53.5447332Z 2025-05-07T19:57:53.5447764Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.5448422Z 2025-05-07T19:57:53.5450230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.5453063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.5454331Z ^ 2025-05-07T19:57:53.5454702Z 2025-05-07T19:57:53.5456438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.5459231Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.5460708Z ^ 2025-05-07T19:57:53.5460961Z 2025-05-07T19:57:53.5461420Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.5462121Z 2025-05-07T19:57:53.5463875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.5466760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.5468065Z ^ 2025-05-07T19:57:53.5468449Z 2025-05-07T19:57:53.5470144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.5472965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.5474161Z ^ 2025-05-07T19:57:53.5474406Z 2025-05-07T19:57:53.5474870Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.5475551Z 2025-05-07T19:57:53.5477253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.5480110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.5481376Z ^ 2025-05-07T19:57:53.5481749Z 2025-05-07T19:57:59.0023939Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:59.0046968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0049600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0050711Z ^ 2025-05-07T19:57:59.0050960Z 2025-05-07T19:57:59.0051541Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:59.0052156Z 2025-05-07T19:57:59.0053687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0056238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0057406Z ^ 2025-05-07T19:57:59.0057748Z 2025-05-07T19:57:59.0059377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0062132Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0063311Z ^ 2025-05-07T19:57:59.0063572Z 2025-05-07T19:57:59.0064051Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:59.0064698Z 2025-05-07T19:57:59.0066331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0069015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0070223Z ^ 2025-05-07T19:57:59.0070581Z 2025-05-07T19:57:59.0072159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0074738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0075919Z ^ 2025-05-07T19:57:59.0076180Z 2025-05-07T19:57:59.0076635Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:59.0077262Z 2025-05-07T19:57:59.0078705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0081340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0082457Z ^ 2025-05-07T19:57:59.0082796Z 2025-05-07T19:57:59.0084395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0087003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0088348Z ^ 2025-05-07T19:57:59.0088595Z 2025-05-07T19:57:59.0089055Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:59.0089682Z 2025-05-07T19:57:59.0091454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0093982Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0096533Z ^ 2025-05-07T19:57:59.0096902Z 2025-05-07T19:57:59.0098493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0101215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0102274Z ^ 2025-05-07T19:57:59.0102552Z 2025-05-07T19:57:59.0102984Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:59.0103672Z 2025-05-07T19:57:59.0105259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0107810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0108925Z ^ 2025-05-07T19:57:59.0109278Z 2025-05-07T19:58:05.9777121Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:58:05.9798917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:05.9802001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:05.9803341Z ^ 2025-05-07T19:58:05.9803687Z 2025-05-07T19:58:05.9804155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:05.9804829Z 2025-05-07T19:58:05.9806525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:05.9809317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:05.9810473Z ^ 2025-05-07T19:58:05.9810821Z 2025-05-07T19:58:05.9812373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:05.9814952Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:05.9816103Z ^ 2025-05-07T19:58:05.9816350Z 2025-05-07T19:58:05.9816767Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:05.9817384Z 2025-05-07T19:58:05.9818990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:05.9821831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:05.9823179Z ^ 2025-05-07T19:58:05.9823536Z 2025-05-07T19:58:05.9825084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:05.9827646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:05.9828742Z ^ 2025-05-07T19:58:05.9828978Z 2025-05-07T19:58:05.9829540Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:05.9830149Z 2025-05-07T19:58:05.9831834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:05.9834559Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:05.9835700Z ^ 2025-05-07T19:58:05.9836074Z 2025-05-07T19:58:05.9837856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:05.9840729Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:05.9841690Z ^ 2025-05-07T19:58:05.9842189Z 2025-05-07T19:58:05.9842636Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:05.9843365Z 2025-05-07T19:58:05.9844727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:05.9847149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:05.9848259Z ^ 2025-05-07T19:58:05.9848588Z 2025-05-07T19:58:05.9850072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:05.9852586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:05.9853761Z ^ 2025-05-07T19:58:05.9853993Z 2025-05-07T19:58:05.9854396Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:05.9854975Z 2025-05-07T19:58:05.9856518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:05.9859352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:05.9860556Z ^ 2025-05-07T19:58:05.9860880Z 2025-05-07T19:58:18.9832313Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:18.9856915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9859793Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9860831Z ^ 2025-05-07T19:58:18.9861161Z 2025-05-07T19:58:18.9861645Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.9862321Z 2025-05-07T19:58:18.9864045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9866857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9868087Z ^ 2025-05-07T19:58:18.9868455Z 2025-05-07T19:58:18.9870131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9872757Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9873957Z ^ 2025-05-07T19:58:18.9874220Z 2025-05-07T19:58:18.9874652Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.9875342Z 2025-05-07T19:58:18.9877048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9879831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9881088Z ^ 2025-05-07T19:58:18.9881463Z 2025-05-07T19:58:18.9883181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9885927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9887127Z ^ 2025-05-07T19:58:18.9887378Z 2025-05-07T19:58:18.9887827Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.9888518Z 2025-05-07T19:58:18.9890260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9893306Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9894501Z ^ 2025-05-07T19:58:18.9894868Z 2025-05-07T19:58:18.9896673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9899091Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9900370Z ^ 2025-05-07T19:58:18.9900771Z 2025-05-07T19:58:18.9901177Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.9901763Z 2025-05-07T19:58:18.9903410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9906071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9907265Z ^ 2025-05-07T19:58:18.9907639Z 2025-05-07T19:58:18.9909288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9912009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9913212Z ^ 2025-05-07T19:58:18.9913466Z 2025-05-07T19:58:18.9913911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.9914590Z 2025-05-07T19:58:18.9916219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9918976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9920175Z ^ 2025-05-07T19:58:18.9920556Z 2025-05-07T19:58:21.1007011Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:21.1032328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.1035213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.1036405Z ^ 2025-05-07T19:58:21.1036644Z 2025-05-07T19:58:21.1037131Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.1037824Z 2025-05-07T19:58:21.1039595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.1042217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.1043456Z ^ 2025-05-07T19:58:21.1043832Z 2025-05-07T19:58:21.1045583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.1048349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.1049578Z ^ 2025-05-07T19:58:21.1049850Z 2025-05-07T19:58:21.1050228Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.1050924Z 2025-05-07T19:58:21.1052708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.1055517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.1056769Z ^ 2025-05-07T19:58:21.1057144Z 2025-05-07T19:58:21.1058779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.1061702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.1062925Z ^ 2025-05-07T19:58:21.1063152Z 2025-05-07T19:58:21.1063604Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.1064549Z 2025-05-07T19:58:21.1066304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.1069169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.1070349Z ^ 2025-05-07T19:58:21.1070732Z 2025-05-07T19:58:21.1072464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.1075358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.1076479Z ^ 2025-05-07T19:58:21.1076750Z 2025-05-07T19:58:21.1077213Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.1077909Z 2025-05-07T19:58:21.1079672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.1082453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.1083708Z ^ 2025-05-07T19:58:21.1084073Z 2025-05-07T19:58:21.1085633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.1088457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.1089692Z ^ 2025-05-07T19:58:21.1089946Z 2025-05-07T19:58:21.1090407Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.1091047Z 2025-05-07T19:58:21.1092795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.1095510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.1096755Z ^ 2025-05-07T19:58:21.1097130Z 2025-05-07T19:58:22.1973740Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:58:22.1995415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:22.1998246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:22.1999384Z ^ 2025-05-07T19:58:22.1999635Z 2025-05-07T19:58:22.2000063Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:22.2000702Z 2025-05-07T19:58:22.2002291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:22.2004768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:22.2005881Z ^ 2025-05-07T19:58:22.2006231Z 2025-05-07T19:58:22.2007743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:22.2010276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:22.2011460Z ^ 2025-05-07T19:58:22.2011717Z 2025-05-07T19:58:22.2012169Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:22.2012824Z 2025-05-07T19:58:22.2014540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:22.2017335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:22.2018577Z ^ 2025-05-07T19:58:22.2018952Z 2025-05-07T19:58:22.2020361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:22.2022521Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:22.2023047Z ^ 2025-05-07T19:58:22.2023280Z 2025-05-07T19:58:22.2024687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:22.2027452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:22.2028555Z ^ 2025-05-07T19:58:22.2028800Z 2025-05-07T19:58:22.2029203Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:22.2029886Z 2025-05-07T19:58:22.2031485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:22.2034161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:22.2035401Z ^ 2025-05-07T19:58:22.2035773Z 2025-05-07T19:58:22.2037184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:22.2039007Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:22.2039532Z ^ 2025-05-07T19:58:22.2039761Z 2025-05-07T19:58:22.2041275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:22.2043561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:22.2044666Z ^ 2025-05-07T19:58:22.2044898Z 2025-05-07T19:58:22.2045353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:22.2046021Z 2025-05-07T19:58:22.2047736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:22.2050411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:22.2051587Z ^ 2025-05-07T19:58:22.2051935Z 2025-05-07T19:58:22.2053312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:22.2055099Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:22.2055651Z ^ 2025-05-07T19:58:22.2055888Z 2025-05-07T19:58:22.2057549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:22.2060425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:22.2061637Z ^ 2025-05-07T19:58:22.2061900Z 2025-05-07T19:58:22.2062347Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:22.2063031Z 2025-05-07T19:58:22.2064667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:22.2067752Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:22.2068973Z ^ 2025-05-07T19:58:22.2069342Z 2025-05-07T19:58:22.2070897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:22.2072562Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:22.2073060Z ^ 2025-05-07T19:58:22.2073351Z 2025-05-07T19:58:23.8685807Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:23.8710371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8713119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8714260Z ^ 2025-05-07T19:58:23.8714507Z 2025-05-07T19:58:23.8714948Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8715627Z 2025-05-07T19:58:23.8717310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8720401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8721840Z ^ 2025-05-07T19:58:23.8722502Z 2025-05-07T19:58:23.8724199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8727056Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8728162Z ^ 2025-05-07T19:58:23.8728419Z 2025-05-07T19:58:23.8728847Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8729528Z 2025-05-07T19:58:23.8731258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8733942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8735117Z ^ 2025-05-07T19:58:23.8735462Z 2025-05-07T19:58:23.8737040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8739837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8741013Z ^ 2025-05-07T19:58:23.8741270Z 2025-05-07T19:58:23.8741730Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8742425Z 2025-05-07T19:58:23.8744181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8746880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8748081Z ^ 2025-05-07T19:58:23.8748450Z 2025-05-07T19:58:23.8750148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8752907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8754149Z ^ 2025-05-07T19:58:23.8754406Z 2025-05-07T19:58:23.8754878Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8755561Z 2025-05-07T19:58:23.8757210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8760004Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8761125Z ^ 2025-05-07T19:58:23.8761492Z 2025-05-07T19:58:23.8763196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8766237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8767478Z ^ 2025-05-07T19:58:23.8770146Z 2025-05-07T19:58:23.8770676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8771378Z 2025-05-07T19:58:23.8773130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8776031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8777288Z ^ 2025-05-07T19:58:23.8777664Z 2025-05-07T19:58:25.4013647Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:25.4036803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.4039505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.4041041Z ^ 2025-05-07T19:58:25.4041318Z 2025-05-07T19:58:25.4041780Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:25.4042441Z 2025-05-07T19:58:25.4044413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.4047098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.4048309Z ^ 2025-05-07T19:58:25.4048819Z 2025-05-07T19:58:25.4050477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.4052937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.4053973Z ^ 2025-05-07T19:58:25.4054216Z 2025-05-07T19:58:25.4054670Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:25.4055233Z 2025-05-07T19:58:25.4056777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.4059231Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.4060475Z ^ 2025-05-07T19:58:25.4060840Z 2025-05-07T19:58:25.4062289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.4064678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.4065744Z ^ 2025-05-07T19:58:25.4065990Z 2025-05-07T19:58:25.4066382Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:25.4066976Z 2025-05-07T19:58:25.4068531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.4071027Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.4072150Z ^ 2025-05-07T19:58:25.4072494Z 2025-05-07T19:58:25.4073994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.4076450Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.4077525Z ^ 2025-05-07T19:58:25.4077759Z 2025-05-07T19:58:25.4078199Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:25.4078805Z 2025-05-07T19:58:25.4080415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.4083041Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.4084441Z ^ 2025-05-07T19:58:25.4084803Z 2025-05-07T19:58:25.4086537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.4089197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.4090357Z ^ 2025-05-07T19:58:25.4090755Z 2025-05-07T19:58:25.4091184Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:25.4091839Z 2025-05-07T19:58:25.4093372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.4095788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.4096888Z ^ 2025-05-07T19:58:25.4097237Z 2025-05-07T19:58:26.0613168Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:58:26.0636260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.0653240Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.0654415Z ^ 2025-05-07T19:58:26.0654678Z 2025-05-07T19:58:26.0655396Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.0656070Z 2025-05-07T19:58:26.0657760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.0660671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.0661872Z ^ 2025-05-07T19:58:26.0662252Z 2025-05-07T19:58:26.0663898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.0666569Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.0667762Z ^ 2025-05-07T19:58:26.0668001Z 2025-05-07T19:58:26.0668431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.0669121Z 2025-05-07T19:58:26.0670796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.0673492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.0674677Z ^ 2025-05-07T19:58:26.0675054Z 2025-05-07T19:58:26.0676724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.0679401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.0680554Z ^ 2025-05-07T19:58:26.0680822Z 2025-05-07T19:58:26.0681250Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.0681913Z 2025-05-07T19:58:26.0683517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.0686172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.0687360Z ^ 2025-05-07T19:58:26.0687719Z 2025-05-07T19:58:26.0689365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.0692022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.0693200Z ^ 2025-05-07T19:58:26.0693435Z 2025-05-07T19:58:26.0693884Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.0694528Z 2025-05-07T19:58:26.0696200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.0699034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.0700479Z ^ 2025-05-07T19:58:26.0700855Z 2025-05-07T19:58:26.0702495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.0705210Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.0706361Z ^ 2025-05-07T19:58:26.0706621Z 2025-05-07T19:58:26.0707083Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.0707716Z 2025-05-07T19:58:26.0709345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.0712028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.0713185Z ^ 2025-05-07T19:58:26.0713547Z 2025-05-07T19:58:26.6165946Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:26.6190141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6192836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6194011Z ^ 2025-05-07T19:58:26.6194267Z 2025-05-07T19:58:26.6194816Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.6195468Z 2025-05-07T19:58:26.6197132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6199865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6201012Z ^ 2025-05-07T19:58:26.6201383Z 2025-05-07T19:58:26.6203056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6205704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6206879Z ^ 2025-05-07T19:58:26.6207147Z 2025-05-07T19:58:26.6207576Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.6208264Z 2025-05-07T19:58:26.6209935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6212618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6213804Z ^ 2025-05-07T19:58:26.6214172Z 2025-05-07T19:58:26.6215829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6218631Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6219957Z ^ 2025-05-07T19:58:26.6220201Z 2025-05-07T19:58:26.6220627Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.6221325Z 2025-05-07T19:58:26.6223301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6225974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6226971Z ^ 2025-05-07T19:58:26.6227288Z 2025-05-07T19:58:26.6228764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6231495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6232789Z ^ 2025-05-07T19:58:26.6233052Z 2025-05-07T19:58:26.6233518Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.6234216Z 2025-05-07T19:58:26.6236044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6238820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6240127Z ^ 2025-05-07T19:58:26.6240498Z 2025-05-07T19:58:26.6242142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6244845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6246046Z ^ 2025-05-07T19:58:26.6246282Z 2025-05-07T19:58:26.6246730Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.6247399Z 2025-05-07T19:58:26.6249095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6251752Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6252935Z ^ 2025-05-07T19:58:26.6253284Z 2025-05-07T19:58:26.6873961Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:58:26.6897646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6900654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6901797Z ^ 2025-05-07T19:58:26.6902056Z 2025-05-07T19:58:26.6902533Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.6903181Z 2025-05-07T19:58:26.6904879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6907579Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6908791Z ^ 2025-05-07T19:58:26.6909129Z 2025-05-07T19:58:26.6910768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6913429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6914605Z ^ 2025-05-07T19:58:26.6914861Z 2025-05-07T19:58:26.6915320Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.6915971Z 2025-05-07T19:58:26.6917626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6920328Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6921515Z ^ 2025-05-07T19:58:26.6921895Z 2025-05-07T19:58:26.6923767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6926440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6927602Z ^ 2025-05-07T19:58:26.6927854Z 2025-05-07T19:58:26.6928291Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.6928966Z 2025-05-07T19:58:26.6930653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6933294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6934474Z ^ 2025-05-07T19:58:26.6934819Z 2025-05-07T19:58:26.6936474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6939339Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6940798Z ^ 2025-05-07T19:58:26.6941044Z 2025-05-07T19:58:26.6941460Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.6942129Z 2025-05-07T19:58:26.6943739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6946553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6947765Z ^ 2025-05-07T19:58:26.6948137Z 2025-05-07T19:58:26.6949756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6952390Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6953548Z ^ 2025-05-07T19:58:26.6953791Z 2025-05-07T19:58:26.6954232Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.6954892Z 2025-05-07T19:58:26.6956564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.6959296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.6960483Z ^ 2025-05-07T19:58:26.6960821Z 2025-05-07T19:58:26.7763993Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:26.7787608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.7790307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.7791476Z ^ 2025-05-07T19:58:26.7791734Z 2025-05-07T19:58:26.7792174Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.7792870Z 2025-05-07T19:58:26.7794543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.7797265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.7798432Z ^ 2025-05-07T19:58:26.7798815Z 2025-05-07T19:58:26.7800477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.7802922Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.7804091Z ^ 2025-05-07T19:58:26.7804346Z 2025-05-07T19:58:26.7804785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.7805436Z 2025-05-07T19:58:26.7807114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.7809812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.7811009Z ^ 2025-05-07T19:58:26.7811367Z 2025-05-07T19:58:26.7813014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.7815703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.7816867Z ^ 2025-05-07T19:58:26.7817119Z 2025-05-07T19:58:26.7817569Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.7818233Z 2025-05-07T19:58:26.7820055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.7822926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.7824353Z ^ 2025-05-07T19:58:26.7824714Z 2025-05-07T19:58:26.7826547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.7829235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.7830382Z ^ 2025-05-07T19:58:26.7830747Z 2025-05-07T19:58:26.7831169Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.7831805Z 2025-05-07T19:58:26.7833503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.7836169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.7837365Z ^ 2025-05-07T19:58:26.7837723Z 2025-05-07T19:58:26.7839369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.7842026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.7843232Z ^ 2025-05-07T19:58:26.7843488Z 2025-05-07T19:58:26.7843944Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.7844602Z 2025-05-07T19:58:26.7846340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.7849041Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.7850202Z ^ 2025-05-07T19:58:26.7850566Z 2025-05-07T19:58:31.0486182Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:31.0508741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.0511406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.0512485Z ^ 2025-05-07T19:58:31.0512738Z 2025-05-07T19:58:31.0513152Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.0513791Z 2025-05-07T19:58:31.0515422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.0517967Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.0519150Z ^ 2025-05-07T19:58:31.0519479Z 2025-05-07T19:58:31.0521052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.0523882Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.0525051Z ^ 2025-05-07T19:58:31.0525298Z 2025-05-07T19:58:31.0525717Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.0526383Z 2025-05-07T19:58:31.0528001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.0530576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.0531699Z ^ 2025-05-07T19:58:31.0532087Z 2025-05-07T19:58:31.0533657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.0536266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.0537398Z ^ 2025-05-07T19:58:31.0537646Z 2025-05-07T19:58:31.0538062Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.0539032Z 2025-05-07T19:58:31.0540680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.0543556Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.0544680Z ^ 2025-05-07T19:58:31.0545032Z 2025-05-07T19:58:31.0546604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.0549353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.0550535Z ^ 2025-05-07T19:58:31.0550769Z 2025-05-07T19:58:31.0551202Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.0551851Z 2025-05-07T19:58:31.0553426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.0555968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.0557100Z ^ 2025-05-07T19:58:31.0557475Z 2025-05-07T19:58:31.0559073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.0561664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.0562734Z ^ 2025-05-07T19:58:31.0562984Z 2025-05-07T19:58:31.0563397Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.0564017Z 2025-05-07T19:58:31.0565624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.0568147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.0569311Z ^ 2025-05-07T19:58:31.0569663Z 2025-05-07T19:58:31.6834440Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:31.6855999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.6858565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.6859861Z ^ 2025-05-07T19:58:31.6860110Z 2025-05-07T19:58:31.6860508Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.6861139Z 2025-05-07T19:58:31.6862619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.6865070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.6866163Z ^ 2025-05-07T19:58:31.6866515Z 2025-05-07T19:58:31.6867984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.6870313Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.6871444Z ^ 2025-05-07T19:58:31.6871684Z 2025-05-07T19:58:31.6872108Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.6872755Z 2025-05-07T19:58:31.6874377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.6876893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.6878002Z ^ 2025-05-07T19:58:31.6878353Z 2025-05-07T19:58:31.6879872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.6882318Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.6883662Z ^ 2025-05-07T19:58:31.6883889Z 2025-05-07T19:58:31.6884308Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.6884900Z 2025-05-07T19:58:31.6886677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.6889234Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.6890467Z ^ 2025-05-07T19:58:31.6890817Z 2025-05-07T19:58:31.6892375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.6894886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.6895985Z ^ 2025-05-07T19:58:31.6896238Z 2025-05-07T19:58:31.6896674Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.6897314Z 2025-05-07T19:58:31.6898875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.6901469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.6902512Z ^ 2025-05-07T19:58:31.6902862Z 2025-05-07T19:58:31.6904313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.6906774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.6907816Z ^ 2025-05-07T19:58:31.6908050Z 2025-05-07T19:58:31.6908475Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.6909090Z 2025-05-07T19:58:31.6910636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.6913229Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.6914358Z ^ 2025-05-07T19:58:31.6914718Z 2025-05-07T19:58:37.5280763Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:37.5304068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.5306748Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.5307792Z ^ 2025-05-07T19:58:37.5308082Z 2025-05-07T19:58:37.5308543Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.5309231Z 2025-05-07T19:58:37.5310728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.5313314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.5314413Z ^ 2025-05-07T19:58:37.5314731Z 2025-05-07T19:58:37.5316421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.5318766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.5319834Z ^ 2025-05-07T19:58:37.5320072Z 2025-05-07T19:58:37.5320510Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.5321105Z 2025-05-07T19:58:37.5323035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.5325680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.5326914Z ^ 2025-05-07T19:58:37.5327616Z 2025-05-07T19:58:37.5329017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.5331888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.5332996Z ^ 2025-05-07T19:58:37.5333264Z 2025-05-07T19:58:37.5333745Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.5334433Z 2025-05-07T19:58:37.5335959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.5338753Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.5340196Z ^ 2025-05-07T19:58:37.5340504Z 2025-05-07T19:58:37.5342069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.5344627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.5345847Z ^ 2025-05-07T19:58:37.5346111Z 2025-05-07T19:58:37.5346580Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.5347139Z 2025-05-07T19:58:37.5348538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.5351139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.5352388Z ^ 2025-05-07T19:58:37.5352765Z 2025-05-07T19:58:37.5354163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.5356776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.5357781Z ^ 2025-05-07T19:58:37.5358008Z 2025-05-07T19:58:37.5358437Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.5359126Z 2025-05-07T19:58:37.5360685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.5363270Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.5364320Z ^ 2025-05-07T19:58:37.5364629Z 2025-05-07T19:58:38.0579989Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:38.0603275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.0606213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.0607439Z ^ 2025-05-07T19:58:38.0607712Z 2025-05-07T19:58:38.0608189Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.0608888Z 2025-05-07T19:58:38.0610520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.0613115Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.0614340Z ^ 2025-05-07T19:58:38.0614707Z 2025-05-07T19:58:38.0616392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.0619168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.0620423Z ^ 2025-05-07T19:58:38.0620661Z 2025-05-07T19:58:38.0621113Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.0621793Z 2025-05-07T19:58:38.0623641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.0626614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.0627835Z ^ 2025-05-07T19:58:38.0628185Z 2025-05-07T19:58:38.0630108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.0632910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.0634238Z ^ 2025-05-07T19:58:38.0634531Z 2025-05-07T19:58:38.0634994Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.0635695Z 2025-05-07T19:58:38.0637454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.0640156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.0641363Z ^ 2025-05-07T19:58:38.0641705Z 2025-05-07T19:58:38.0643308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.0645927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.0647172Z ^ 2025-05-07T19:58:38.0647437Z 2025-05-07T19:58:38.0647856Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.0648582Z 2025-05-07T19:58:38.0650306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.0652905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.0654045Z ^ 2025-05-07T19:58:38.0654426Z 2025-05-07T19:58:38.0656054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.0658628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.0660015Z ^ 2025-05-07T19:58:38.0660304Z 2025-05-07T19:58:38.0660779Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.0661486Z 2025-05-07T19:58:38.0663163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.0665971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.0667254Z ^ 2025-05-07T19:58:38.0667644Z 2025-05-07T19:58:38.3342932Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:38.3360309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.3362283Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.3363177Z ^ 2025-05-07T19:58:38.3363368Z 2025-05-07T19:58:38.3363692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.3364202Z 2025-05-07T19:58:38.3365418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.3367346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.3368216Z ^ 2025-05-07T19:58:38.3368486Z 2025-05-07T19:58:38.3369658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.3371534Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.3372402Z ^ 2025-05-07T19:58:38.3372750Z 2025-05-07T19:58:38.3373093Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.3373573Z 2025-05-07T19:58:38.3374879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.3376836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.3377682Z ^ 2025-05-07T19:58:38.3377952Z 2025-05-07T19:58:38.3379137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.3381278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.3382150Z ^ 2025-05-07T19:58:38.3382364Z 2025-05-07T19:58:38.3382707Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.3383196Z 2025-05-07T19:58:38.3384474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.3386458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.3387435Z ^ 2025-05-07T19:58:38.3387735Z 2025-05-07T19:58:38.3389047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.3391218Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.3392317Z ^ 2025-05-07T19:58:38.3392569Z 2025-05-07T19:58:38.3393033Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.3393638Z 2025-05-07T19:58:38.3395127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.3397794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.3398889Z ^ 2025-05-07T19:58:38.3399188Z 2025-05-07T19:58:38.3400396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.3402342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.3403250Z ^ 2025-05-07T19:58:38.3403452Z 2025-05-07T19:58:38.3403771Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.3404252Z 2025-05-07T19:58:38.3405508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.3407476Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.3408508Z ^ 2025-05-07T19:58:38.3408767Z 2025-05-07T19:58:38.4863919Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:58:38.4881510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.4883574Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.4884474Z ^ 2025-05-07T19:58:38.4884673Z 2025-05-07T19:58:38.4885012Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.4885505Z 2025-05-07T19:58:38.4886728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.4888675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.4889605Z ^ 2025-05-07T19:58:38.4889866Z 2025-05-07T19:58:38.4891084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.4893275Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.4894178Z ^ 2025-05-07T19:58:38.4894403Z 2025-05-07T19:58:38.4894756Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.4895417Z 2025-05-07T19:58:38.4896650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.4898670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.4899699Z ^ 2025-05-07T19:58:38.4899983Z 2025-05-07T19:58:38.4901169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.4903063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.4903916Z ^ 2025-05-07T19:58:38.4904101Z 2025-05-07T19:58:38.4904432Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.4904926Z 2025-05-07T19:58:38.4906116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.4907998Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.4908829Z ^ 2025-05-07T19:58:38.4909108Z 2025-05-07T19:58:38.4910258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.4912169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.4913021Z ^ 2025-05-07T19:58:38.4913232Z 2025-05-07T19:58:38.4913552Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.4914037Z 2025-05-07T19:58:38.4915243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.4917167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.4918030Z ^ 2025-05-07T19:58:38.4918301Z 2025-05-07T19:58:38.4919549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.4921568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.4922761Z ^ 2025-05-07T19:58:38.4922958Z 2025-05-07T19:58:38.4923284Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.4923833Z 2025-05-07T19:58:38.4925129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.4927500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.4928411Z ^ 2025-05-07T19:58:38.4928713Z 2025-05-07T19:58:39.3882095Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:39.3899074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.3901157Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.3902042Z ^ 2025-05-07T19:58:39.3902229Z 2025-05-07T19:58:39.3902573Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.3903063Z 2025-05-07T19:58:39.3904251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.3906171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.3907273Z ^ 2025-05-07T19:58:39.3907564Z 2025-05-07T19:58:39.3908822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.3910947Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.3911908Z ^ 2025-05-07T19:58:39.3912119Z 2025-05-07T19:58:39.3912467Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.3914142Z 2025-05-07T19:58:39.3915512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.3917896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.3919025Z ^ 2025-05-07T19:58:39.3919333Z 2025-05-07T19:58:39.3920824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.3923765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.3924693Z ^ 2025-05-07T19:58:39.3924887Z 2025-05-07T19:58:39.3925231Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.3925736Z 2025-05-07T19:58:39.3926926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.3928869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.3929702Z ^ 2025-05-07T19:58:39.3929974Z 2025-05-07T19:58:39.3931195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.3933102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.3933938Z ^ 2025-05-07T19:58:39.3934140Z 2025-05-07T19:58:39.3934473Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.3934975Z 2025-05-07T19:58:39.3936197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.3938134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.3939033Z ^ 2025-05-07T19:58:39.3939304Z 2025-05-07T19:58:39.3940606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.3942552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.3943409Z ^ 2025-05-07T19:58:39.3943803Z 2025-05-07T19:58:39.3944125Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.3944647Z 2025-05-07T19:58:39.3945964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.3947989Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.3948853Z ^ 2025-05-07T19:58:39.3949160Z 2025-05-07T19:58:39.6763798Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:39.6780095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.6781965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.6782872Z ^ 2025-05-07T19:58:39.6783083Z 2025-05-07T19:58:39.6783471Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.6784108Z 2025-05-07T19:58:39.6785429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.6788051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.6789138Z ^ 2025-05-07T19:58:39.6789696Z 2025-05-07T19:58:39.6791248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.6794169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.6795616Z ^ 2025-05-07T19:58:39.6795895Z 2025-05-07T19:58:39.6796393Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.6797118Z 2025-05-07T19:58:39.6798865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.6801017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.6801856Z ^ 2025-05-07T19:58:39.6802131Z 2025-05-07T19:58:39.6803268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.6805143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.6805978Z ^ 2025-05-07T19:58:39.6806215Z 2025-05-07T19:58:39.6806552Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.6807088Z 2025-05-07T19:58:39.6808537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.6810961Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.6812031Z ^ 2025-05-07T19:58:39.6812366Z 2025-05-07T19:58:39.6813892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.6816291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.6817427Z ^ 2025-05-07T19:58:39.6817682Z 2025-05-07T19:58:39.6818119Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.6818761Z 2025-05-07T19:58:39.6820526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.6823402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.6824554Z ^ 2025-05-07T19:58:39.6824923Z 2025-05-07T19:58:39.6826499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.6829358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.6830473Z ^ 2025-05-07T19:58:39.6830754Z 2025-05-07T19:58:39.6831411Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.6832070Z 2025-05-07T19:58:39.6833526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.6836186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.6837354Z ^ 2025-05-07T19:58:39.6837708Z 2025-05-07T19:58:44.1792823Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:44.1814561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.1817107Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.1818567Z ^ 2025-05-07T19:58:44.1818780Z 2025-05-07T19:58:44.1819214Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.1819925Z 2025-05-07T19:58:44.1821756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.1824576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.1825896Z ^ 2025-05-07T19:58:44.1826235Z 2025-05-07T19:58:44.1827835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.1830468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.1831661Z ^ 2025-05-07T19:58:44.1831907Z 2025-05-07T19:58:44.1832324Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.1832958Z 2025-05-07T19:58:44.1834539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.1837367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.1838427Z ^ 2025-05-07T19:58:44.1838732Z 2025-05-07T19:58:44.1840156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.1842344Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.1843370Z ^ 2025-05-07T19:58:44.1843604Z 2025-05-07T19:58:44.1844058Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.1844691Z 2025-05-07T19:58:44.1846206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.1848698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.1849871Z ^ 2025-05-07T19:58:44.1850288Z 2025-05-07T19:58:44.1851769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.1854384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.1855479Z ^ 2025-05-07T19:58:44.1855761Z 2025-05-07T19:58:44.1856198Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.1856862Z 2025-05-07T19:58:44.1858408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.1860902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.1862264Z ^ 2025-05-07T19:58:44.1862606Z 2025-05-07T19:58:44.1864438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.1866904Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.1868087Z ^ 2025-05-07T19:58:44.1868466Z 2025-05-07T19:58:44.1868870Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.1869522Z 2025-05-07T19:58:44.1870923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.1873613Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.1874723Z ^ 2025-05-07T19:58:44.1875087Z 2025-05-07T19:58:47.2012612Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:47.2034147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2037098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2038230Z ^ 2025-05-07T19:58:47.2038734Z 2025-05-07T19:58:47.2039158Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2039774Z 2025-05-07T19:58:47.2041220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2043722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2044686Z ^ 2025-05-07T19:58:47.2045000Z 2025-05-07T19:58:47.2046556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2049059Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2050123Z ^ 2025-05-07T19:58:47.2050328Z 2025-05-07T19:58:47.2050810Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2051387Z 2025-05-07T19:58:47.2052825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2055231Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2056358Z ^ 2025-05-07T19:58:47.2056704Z 2025-05-07T19:58:47.2058200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2060935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2062010Z ^ 2025-05-07T19:58:47.2062265Z 2025-05-07T19:58:47.2062671Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2063333Z 2025-05-07T19:58:47.2064899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2067409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2068472Z ^ 2025-05-07T19:58:47.2068839Z 2025-05-07T19:58:47.2070376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2072693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2073788Z ^ 2025-05-07T19:58:47.2074018Z 2025-05-07T19:58:47.2074448Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2075080Z 2025-05-07T19:58:47.2076870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2079463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2080593Z ^ 2025-05-07T19:58:47.2080925Z 2025-05-07T19:58:47.2082435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2084995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2085937Z ^ 2025-05-07T19:58:47.2086173Z 2025-05-07T19:58:47.2086588Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2087254Z 2025-05-07T19:58:47.2088718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2091152Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2092185Z ^ 2025-05-07T19:58:47.2092502Z 2025-05-07T19:58:48.8833877Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:58:48.8856609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.8859363Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:48.8860651Z ^ 2025-05-07T19:58:48.8861102Z 2025-05-07T19:58:48.8861552Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:48.8862181Z 2025-05-07T19:58:48.8863733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.8866356Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:48.8867546Z ^ 2025-05-07T19:58:48.8867882Z 2025-05-07T19:58:48.8869489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.8872043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:48.8873127Z ^ 2025-05-07T19:58:48.8873395Z 2025-05-07T19:58:48.8873810Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:48.8874451Z 2025-05-07T19:58:48.8875976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.8878575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:48.8879805Z ^ 2025-05-07T19:58:48.8880135Z 2025-05-07T19:58:48.8881687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.8884274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:48.8885415Z ^ 2025-05-07T19:58:48.8885662Z 2025-05-07T19:58:48.8886134Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:48.8886827Z 2025-05-07T19:58:48.8888418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.8890960Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:48.8892049Z ^ 2025-05-07T19:58:48.8892387Z 2025-05-07T19:58:48.8893880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.8896515Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:48.8898846Z ^ 2025-05-07T19:58:48.8899107Z 2025-05-07T19:58:48.8899699Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:48.8900349Z 2025-05-07T19:58:48.8902008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.8904482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:48.8905690Z ^ 2025-05-07T19:58:48.8906031Z 2025-05-07T19:58:48.8907572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.8910140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:48.8911269Z ^ 2025-05-07T19:58:48.8911513Z 2025-05-07T19:58:48.8911933Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:48.8912638Z 2025-05-07T19:58:48.8914233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.8916839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:48.8917979Z ^ 2025-05-07T19:58:48.8918326Z 2025-05-07T19:58:54.7253297Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:54.7275320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7278149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.7279283Z ^ 2025-05-07T19:58:54.7279536Z 2025-05-07T19:58:54.7279971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.7280617Z 2025-05-07T19:58:54.7282179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7284722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.7285825Z ^ 2025-05-07T19:58:54.7286185Z 2025-05-07T19:58:54.7287714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7290151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.7291238Z ^ 2025-05-07T19:58:54.7291473Z 2025-05-07T19:58:54.7291897Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.7292520Z 2025-05-07T19:58:54.7294285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7296821Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.7297943Z ^ 2025-05-07T19:58:54.7298272Z 2025-05-07T19:58:54.7299913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7302415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.7303522Z ^ 2025-05-07T19:58:54.7303745Z 2025-05-07T19:58:54.7304157Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.7304767Z 2025-05-07T19:58:54.7306371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7308868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.7309992Z ^ 2025-05-07T19:58:54.7310327Z 2025-05-07T19:58:54.7311889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7314533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.7315747Z ^ 2025-05-07T19:58:54.7315972Z 2025-05-07T19:58:54.7316397Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.7317024Z 2025-05-07T19:58:54.7318585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7321245Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.7322588Z ^ 2025-05-07T19:58:54.7322915Z 2025-05-07T19:58:54.7324448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7326950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.7328024Z ^ 2025-05-07T19:58:54.7328261Z 2025-05-07T19:58:54.7328676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.7329299Z 2025-05-07T19:58:54.7330896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7333420Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.7334564Z ^ 2025-05-07T19:58:54.7334904Z 2025-05-07T19:58:55.6509623Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:55.6533501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.6536262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.6537434Z ^ 2025-05-07T19:58:55.6537674Z 2025-05-07T19:58:55.6538124Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.6538746Z 2025-05-07T19:58:55.6540544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.6543213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.6544401Z ^ 2025-05-07T19:58:55.6544754Z 2025-05-07T19:58:55.6546413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.6549007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.6550184Z ^ 2025-05-07T19:58:55.6550432Z 2025-05-07T19:58:55.6550888Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.6551539Z 2025-05-07T19:58:55.6553183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.6555881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.6557057Z ^ 2025-05-07T19:58:55.6557367Z 2025-05-07T19:58:55.6559025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.6561722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.6563074Z ^ 2025-05-07T19:58:55.6563342Z 2025-05-07T19:58:55.6563785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.6564453Z 2025-05-07T19:58:55.6565962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.6568573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.6569592Z ^ 2025-05-07T19:58:55.6569894Z 2025-05-07T19:58:55.6571638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.6574122Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.6575450Z ^ 2025-05-07T19:58:55.6575698Z 2025-05-07T19:58:55.6576163Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.6576854Z 2025-05-07T19:58:55.6578412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.6581368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.6582682Z ^ 2025-05-07T19:58:55.6583088Z 2025-05-07T19:58:55.6585045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.6587601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.6588678Z ^ 2025-05-07T19:58:55.6588912Z 2025-05-07T19:58:55.6589315Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.6589902Z 2025-05-07T19:58:55.6591598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.6594132Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.6595244Z ^ 2025-05-07T19:58:55.6595534Z 2025-05-07T19:58:55.8013446Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:58:55.8035848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8038389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8039596Z ^ 2025-05-07T19:58:55.8039844Z 2025-05-07T19:58:55.8040274Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.8040877Z 2025-05-07T19:58:55.8042277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8044726Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8045865Z ^ 2025-05-07T19:58:55.8046230Z 2025-05-07T19:58:55.8047746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8049426Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8049899Z ^ 2025-05-07T19:58:55.8050156Z 2025-05-07T19:58:55.8051699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8053625Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8054168Z ^ 2025-05-07T19:58:55.8054453Z 2025-05-07T19:58:55.8055862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8057542Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8058025Z ^ 2025-05-07T19:58:55.8058295Z 2025-05-07T19:58:55.8059968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8062543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8063766Z ^ 2025-05-07T19:58:55.8063980Z 2025-05-07T19:58:55.8064358Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.8064937Z 2025-05-07T19:58:55.8068006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8070597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8071721Z ^ 2025-05-07T19:58:55.8072028Z 2025-05-07T19:58:55.8073372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8075113Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8075647Z ^ 2025-05-07T19:58:55.8075922Z 2025-05-07T19:58:55.8077443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8079295Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8079767Z ^ 2025-05-07T19:58:55.8080009Z 2025-05-07T19:58:55.8081318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8082994Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8083518Z ^ 2025-05-07T19:58:55.8083813Z 2025-05-07T19:58:55.8085283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8087775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8088920Z ^ 2025-05-07T19:58:55.8089138Z 2025-05-07T19:58:55.8089527Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.8090090Z 2025-05-07T19:58:55.8091461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8093693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8094824Z ^ 2025-05-07T19:58:55.8095128Z 2025-05-07T19:58:55.8096536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8098312Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8098773Z ^ 2025-05-07T19:58:55.8099025Z 2025-05-07T19:58:55.8100490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8102124Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8102596Z ^ 2025-05-07T19:58:55.8102983Z 2025-05-07T19:58:55.8104322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8106103Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8106559Z ^ 2025-05-07T19:58:55.8106919Z 2025-05-07T19:58:55.8108343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8110740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8111735Z ^ 2025-05-07T19:58:55.8111971Z 2025-05-07T19:58:55.8112700Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.8113250Z 2025-05-07T19:58:55.8114576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8116675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8117676Z ^ 2025-05-07T19:58:55.8118032Z 2025-05-07T19:58:55.8119415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8121132Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8121575Z ^ 2025-05-07T19:58:55.8121800Z 2025-05-07T19:58:55.8123382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8124939Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8125391Z ^ 2025-05-07T19:58:55.8125627Z 2025-05-07T19:58:55.8126885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8128532Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8128996Z ^ 2025-05-07T19:58:55.8129222Z 2025-05-07T19:58:55.8130635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8132924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8133895Z ^ 2025-05-07T19:58:55.8134121Z 2025-05-07T19:58:55.8134496Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.8135015Z 2025-05-07T19:58:55.8136376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8138474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8139417Z ^ 2025-05-07T19:58:55.8139855Z 2025-05-07T19:58:55.8141174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8143073Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8143498Z ^ 2025-05-07T19:58:55.8143724Z 2025-05-07T19:58:55.8145252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8147041Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8147577Z ^ 2025-05-07T19:58:55.8147810Z 2025-05-07T19:58:55.8149081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:55.8150721Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:55.8151220Z ^ 2025-05-07T19:58:55.8151459Z 2025-05-07T19:58:59.5255899Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:59.5279213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.5282164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.5283333Z ^ 2025-05-07T19:58:59.5283572Z 2025-05-07T19:58:59.5284016Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.5284948Z 2025-05-07T19:58:59.5286635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.5289324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.5290623Z ^ 2025-05-07T19:58:59.5290992Z 2025-05-07T19:58:59.5292665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.5295361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.5296504Z ^ 2025-05-07T19:58:59.5296765Z 2025-05-07T19:58:59.5297236Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.5297882Z 2025-05-07T19:58:59.5299682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.5302418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.5303525Z ^ 2025-05-07T19:58:59.5303892Z 2025-05-07T19:58:59.5305366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.5308036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.5309143Z ^ 2025-05-07T19:58:59.5309368Z 2025-05-07T19:58:59.5309813Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.5310495Z 2025-05-07T19:58:59.5312149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.5314805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.5315939Z ^ 2025-05-07T19:58:59.5316301Z 2025-05-07T19:58:59.5317838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.5320446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.5321622Z ^ 2025-05-07T19:58:59.5321849Z 2025-05-07T19:58:59.5322544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.5323206Z 2025-05-07T19:58:59.5324840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.5327738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.5328938Z ^ 2025-05-07T19:58:59.5329311Z 2025-05-07T19:58:59.5331199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.5333843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.5335138Z ^ 2025-05-07T19:58:59.5335393Z 2025-05-07T19:58:59.5335834Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.5336506Z 2025-05-07T19:58:59.5338191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.5340989Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.5342183Z ^ 2025-05-07T19:58:59.5342530Z 2025-05-07T19:59:01.8245589Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:01.8270344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.8273438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.8274667Z ^ 2025-05-07T19:59:01.8274929Z 2025-05-07T19:59:01.8275402Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:01.8276169Z 2025-05-07T19:59:01.8277927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.8280739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.8281949Z ^ 2025-05-07T19:59:01.8282331Z 2025-05-07T19:59:01.8284051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.8286809Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.8288007Z ^ 2025-05-07T19:59:01.8288276Z 2025-05-07T19:59:01.8288728Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:01.8289422Z 2025-05-07T19:59:01.8291168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.8293934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.8295114Z ^ 2025-05-07T19:59:01.8295451Z 2025-05-07T19:59:01.8296998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.8299945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.8301176Z ^ 2025-05-07T19:59:01.8301431Z 2025-05-07T19:59:01.8301883Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:01.8302598Z 2025-05-07T19:59:01.8304335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.8307147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.8308374Z ^ 2025-05-07T19:59:01.8308756Z 2025-05-07T19:59:01.8310487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.8313230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.8314440Z ^ 2025-05-07T19:59:01.8314865Z 2025-05-07T19:59:01.8315325Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:01.8316019Z 2025-05-07T19:59:01.8317863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.8320636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.8321880Z ^ 2025-05-07T19:59:01.8322513Z 2025-05-07T19:59:01.8324381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.8327136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.8328353Z ^ 2025-05-07T19:59:01.8328608Z 2025-05-07T19:59:01.8329063Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:01.8329754Z 2025-05-07T19:59:01.8331497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.8347631Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.8349019Z ^ 2025-05-07T19:59:01.8349396Z 2025-05-07T19:59:04.3630285Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:04.3653580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.3656299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.3657709Z ^ 2025-05-07T19:59:04.3657941Z 2025-05-07T19:59:04.3658368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.3659034Z 2025-05-07T19:59:04.3660878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.3663284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.3664365Z ^ 2025-05-07T19:59:04.3664711Z 2025-05-07T19:59:04.3666434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.3669044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.3670223Z ^ 2025-05-07T19:59:04.3670483Z 2025-05-07T19:59:04.3670916Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.3671556Z 2025-05-07T19:59:04.3673237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.3675950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.3677194Z ^ 2025-05-07T19:59:04.3677569Z 2025-05-07T19:59:04.3679280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.3682068Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.3683307Z ^ 2025-05-07T19:59:04.3683560Z 2025-05-07T19:59:04.3684029Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.3684661Z 2025-05-07T19:59:04.3686295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.3688865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.3690059Z ^ 2025-05-07T19:59:04.3690423Z 2025-05-07T19:59:04.3692049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.3694877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.3695946Z ^ 2025-05-07T19:59:04.3696198Z 2025-05-07T19:59:04.3696794Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.3697427Z 2025-05-07T19:59:04.3698922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.3701956Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.3703207Z ^ 2025-05-07T19:59:04.3703578Z 2025-05-07T19:59:04.3705149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.3707711Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.3708907Z ^ 2025-05-07T19:59:04.3709158Z 2025-05-07T19:59:04.3709575Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.3710148Z 2025-05-07T19:59:04.3711484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.3713874Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.3714885Z ^ 2025-05-07T19:59:04.3715244Z 2025-05-07T19:59:06.0625245Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:59:06.0649262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0652221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.0653384Z ^ 2025-05-07T19:59:06.0653614Z 2025-05-07T19:59:06.0654029Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.0654719Z 2025-05-07T19:59:06.0656480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0659209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.0660553Z ^ 2025-05-07T19:59:06.0660921Z 2025-05-07T19:59:06.0662525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0665297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.0666510Z ^ 2025-05-07T19:59:06.0666767Z 2025-05-07T19:59:06.0667233Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.0667938Z 2025-05-07T19:59:06.0669660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0672072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.0673234Z ^ 2025-05-07T19:59:06.0673554Z 2025-05-07T19:59:06.0675026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0677853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.0679099Z ^ 2025-05-07T19:59:06.0679358Z 2025-05-07T19:59:06.0679836Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.0680477Z 2025-05-07T19:59:06.0681976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0684792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.0686210Z ^ 2025-05-07T19:59:06.0686544Z 2025-05-07T19:59:06.0687953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0690390Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.0691434Z ^ 2025-05-07T19:59:06.0691689Z 2025-05-07T19:59:06.0692131Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.0692745Z 2025-05-07T19:59:06.0694550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0696978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.0698180Z ^ 2025-05-07T19:59:06.0698552Z 2025-05-07T19:59:06.0700309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0702869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.0703994Z ^ 2025-05-07T19:59:06.0704233Z 2025-05-07T19:59:06.0704673Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.0705333Z 2025-05-07T19:59:06.0706978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0709618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.0710831Z ^ 2025-05-07T19:59:06.0711209Z 2025-05-07T19:59:06.9686444Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:06.9709513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.9712052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.9713062Z ^ 2025-05-07T19:59:06.9713318Z 2025-05-07T19:59:06.9713741Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.9714371Z 2025-05-07T19:59:06.9715838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.9718526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.9719701Z ^ 2025-05-07T19:59:06.9720060Z 2025-05-07T19:59:06.9721765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.9724801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.9725933Z ^ 2025-05-07T19:59:06.9726170Z 2025-05-07T19:59:06.9726641Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.9727274Z 2025-05-07T19:59:06.9728881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.9731640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.9732843Z ^ 2025-05-07T19:59:06.9733216Z 2025-05-07T19:59:06.9734921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.9737450Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.9738616Z ^ 2025-05-07T19:59:06.9738858Z 2025-05-07T19:59:06.9739292Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.9740098Z 2025-05-07T19:59:06.9741724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.9744730Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.9745917Z ^ 2025-05-07T19:59:06.9746498Z 2025-05-07T19:59:06.9748183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.9750880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.9752229Z ^ 2025-05-07T19:59:06.9752473Z 2025-05-07T19:59:06.9752911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.9753579Z 2025-05-07T19:59:06.9755260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.9757980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.9759154Z ^ 2025-05-07T19:59:06.9759525Z 2025-05-07T19:59:06.9761167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.9763810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.9765003Z ^ 2025-05-07T19:59:06.9765256Z 2025-05-07T19:59:06.9765666Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.9766320Z 2025-05-07T19:59:06.9767989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.9770618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.9771715Z ^ 2025-05-07T19:59:06.9772071Z 2025-05-07T19:59:07.1648978Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:07.1670643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.1673119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:07.1674128Z ^ 2025-05-07T19:59:07.1674362Z 2025-05-07T19:59:07.1674739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:07.1675357Z 2025-05-07T19:59:07.1676881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.1679320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:07.1680474Z ^ 2025-05-07T19:59:07.1680836Z 2025-05-07T19:59:07.1682297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.1684758Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:07.1685799Z ^ 2025-05-07T19:59:07.1686041Z 2025-05-07T19:59:07.1686464Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:07.1687097Z 2025-05-07T19:59:07.1688631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.1691111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:07.1692197Z ^ 2025-05-07T19:59:07.1692534Z 2025-05-07T19:59:07.1693945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.1696259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:07.1697450Z ^ 2025-05-07T19:59:07.1697682Z 2025-05-07T19:59:07.1698089Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:07.1698684Z 2025-05-07T19:59:07.1700564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.1703044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:07.1703995Z ^ 2025-05-07T19:59:07.1704475Z 2025-05-07T19:59:07.1705638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.1707826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:07.1708739Z ^ 2025-05-07T19:59:07.1708953Z 2025-05-07T19:59:07.1709333Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:07.1709889Z 2025-05-07T19:59:07.1711363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.1713642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:07.1714769Z ^ 2025-05-07T19:59:07.1715042Z 2025-05-07T19:59:07.1716575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.1719014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:07.1719926Z ^ 2025-05-07T19:59:07.1720137Z 2025-05-07T19:59:07.1720545Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:07.1721122Z 2025-05-07T19:59:07.1722934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.1725442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:07.1726543Z ^ 2025-05-07T19:59:07.1726872Z 2025-05-07T19:59:09.6803558Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:09.6824703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6827094Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.6828163Z ^ 2025-05-07T19:59:09.6828383Z 2025-05-07T19:59:09.6828790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.6829378Z 2025-05-07T19:59:09.6830930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6833360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.6834371Z ^ 2025-05-07T19:59:09.6834687Z 2025-05-07T19:59:09.6836087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6838442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.6839440Z ^ 2025-05-07T19:59:09.6839666Z 2025-05-07T19:59:09.6840059Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.6840630Z 2025-05-07T19:59:09.6842155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6844591Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.6845671Z ^ 2025-05-07T19:59:09.6846002Z 2025-05-07T19:59:09.6847335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6849978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.6851041Z ^ 2025-05-07T19:59:09.6851257Z 2025-05-07T19:59:09.6851885Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.6852453Z 2025-05-07T19:59:09.6853952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6856336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.6857336Z ^ 2025-05-07T19:59:09.6857673Z 2025-05-07T19:59:09.6859083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6861626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.6862625Z ^ 2025-05-07T19:59:09.6862861Z 2025-05-07T19:59:09.6863266Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.6863840Z 2025-05-07T19:59:09.6865299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6867631Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.6868714Z ^ 2025-05-07T19:59:09.6869026Z 2025-05-07T19:59:09.6870493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6872899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.6873834Z ^ 2025-05-07T19:59:09.6874030Z 2025-05-07T19:59:09.6874398Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.6874956Z 2025-05-07T19:59:09.6876478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6879003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.6880033Z ^ 2025-05-07T19:59:09.6880387Z 2025-05-07T19:59:10.1184519Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:10.1208493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.1211324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.1212535Z ^ 2025-05-07T19:59:10.1212800Z 2025-05-07T19:59:10.1213262Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.1213943Z 2025-05-07T19:59:10.1215676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.1218362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.1219718Z ^ 2025-05-07T19:59:10.1220093Z 2025-05-07T19:59:10.1221789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.1224750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.1225935Z ^ 2025-05-07T19:59:10.1226186Z 2025-05-07T19:59:10.1226651Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.1227288Z 2025-05-07T19:59:10.1228882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.1231605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.1233051Z ^ 2025-05-07T19:59:10.1233414Z 2025-05-07T19:59:10.1235190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.1237867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.1239052Z ^ 2025-05-07T19:59:10.1239322Z 2025-05-07T19:59:10.1239850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.1240536Z 2025-05-07T19:59:10.1242274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.1245037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.1246238Z ^ 2025-05-07T19:59:10.1246551Z 2025-05-07T19:59:10.1247785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.1249889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.1250998Z ^ 2025-05-07T19:59:10.1251234Z 2025-05-07T19:59:10.1251635Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.1252268Z 2025-05-07T19:59:10.1253904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.1256480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.1257585Z ^ 2025-05-07T19:59:10.1257929Z 2025-05-07T19:59:10.1259709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.1262431Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.1263639Z ^ 2025-05-07T19:59:10.1263913Z 2025-05-07T19:59:10.1264377Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.1265056Z 2025-05-07T19:59:10.1266741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.1269423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.1270641Z ^ 2025-05-07T19:59:10.1271005Z 2025-05-07T19:59:10.3837603Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:10.3860276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.3863011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.3864202Z ^ 2025-05-07T19:59:10.3864466Z 2025-05-07T19:59:10.3864925Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.3865610Z 2025-05-07T19:59:10.3867321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.3870049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.3871257Z ^ 2025-05-07T19:59:10.3871646Z 2025-05-07T19:59:10.3873421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.3876190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.3877400Z ^ 2025-05-07T19:59:10.3877661Z 2025-05-07T19:59:10.3878117Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.3878801Z 2025-05-07T19:59:10.3880752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.3883635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.3884749Z ^ 2025-05-07T19:59:10.3885117Z 2025-05-07T19:59:10.3886829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.3889510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.3890715Z ^ 2025-05-07T19:59:10.3890970Z 2025-05-07T19:59:10.3891434Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.3892102Z 2025-05-07T19:59:10.3893832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.3896596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.3897807Z ^ 2025-05-07T19:59:10.3898173Z 2025-05-07T19:59:10.3899933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.3902640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.3903838Z ^ 2025-05-07T19:59:10.3904104Z 2025-05-07T19:59:10.3904557Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.3905233Z 2025-05-07T19:59:10.3906969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.3909633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.3910840Z ^ 2025-05-07T19:59:10.3911203Z 2025-05-07T19:59:10.3912874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.3915558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.3916763Z ^ 2025-05-07T19:59:10.3917018Z 2025-05-07T19:59:10.3917481Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.3918148Z 2025-05-07T19:59:10.3919778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.3922667Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.3923866Z ^ 2025-05-07T19:59:10.3924246Z 2025-05-07T19:59:10.4756436Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:10.4780461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.4783269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.4784459Z ^ 2025-05-07T19:59:10.4784729Z 2025-05-07T19:59:10.4785252Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.4785927Z 2025-05-07T19:59:10.4787652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.4790366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.4791563Z ^ 2025-05-07T19:59:10.4791929Z 2025-05-07T19:59:10.4793588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.4796353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.4797712Z ^ 2025-05-07T19:59:10.4797968Z 2025-05-07T19:59:10.4798434Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.4799121Z 2025-05-07T19:59:10.4800948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.4803739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.4805024Z ^ 2025-05-07T19:59:10.4805389Z 2025-05-07T19:59:10.4807101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.4809789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.4810997Z ^ 2025-05-07T19:59:10.4811252Z 2025-05-07T19:59:10.4811718Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.4812399Z 2025-05-07T19:59:10.4814057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.4816065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.4817025Z ^ 2025-05-07T19:59:10.4817355Z 2025-05-07T19:59:10.4818919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.4821623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.4823039Z ^ 2025-05-07T19:59:10.4823282Z 2025-05-07T19:59:10.4823732Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.4824361Z 2025-05-07T19:59:10.4825960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.4828565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.4829760Z ^ 2025-05-07T19:59:10.4830143Z 2025-05-07T19:59:10.4831848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.4834526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.4835692Z ^ 2025-05-07T19:59:10.4835949Z 2025-05-07T19:59:10.4836397Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:10.4837086Z 2025-05-07T19:59:10.4838780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.4841674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:10.4842877Z ^ 2025-05-07T19:59:10.4843239Z 2025-05-07T19:59:11.9287467Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:11.9307544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.9309786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.9310774Z ^ 2025-05-07T19:59:11.9310994Z 2025-05-07T19:59:11.9311387Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.9311953Z 2025-05-07T19:59:11.9313365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.9315633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.9316635Z ^ 2025-05-07T19:59:11.9317185Z 2025-05-07T19:59:11.9318575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.9320959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.9322193Z ^ 2025-05-07T19:59:11.9322425Z 2025-05-07T19:59:11.9322806Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.9323434Z 2025-05-07T19:59:11.9331297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.9333737Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.9334750Z ^ 2025-05-07T19:59:11.9335062Z 2025-05-07T19:59:11.9336433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.9338690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.9339858Z ^ 2025-05-07T19:59:11.9340079Z 2025-05-07T19:59:11.9340461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.9341060Z 2025-05-07T19:59:11.9342498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.9344871Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.9345928Z ^ 2025-05-07T19:59:11.9346232Z 2025-05-07T19:59:11.9347640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.9349933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.9350928Z ^ 2025-05-07T19:59:11.9351141Z 2025-05-07T19:59:11.9351524Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.9352142Z 2025-05-07T19:59:11.9353585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.9355914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.9356924Z ^ 2025-05-07T19:59:11.9357229Z 2025-05-07T19:59:11.9358580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.9360882Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.9361932Z ^ 2025-05-07T19:59:11.9362158Z 2025-05-07T19:59:11.9362557Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.9363368Z 2025-05-07T19:59:11.9364876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.9368692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.9369765Z ^ 2025-05-07T19:59:11.9370069Z 2025-05-07T19:59:17.3400502Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:17.3423444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3426250Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3427382Z ^ 2025-05-07T19:59:17.3427633Z 2025-05-07T19:59:17.3428122Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.3428772Z 2025-05-07T19:59:17.3430369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3433248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3434399Z ^ 2025-05-07T19:59:17.3434788Z 2025-05-07T19:59:17.3436480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3438916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3440267Z ^ 2025-05-07T19:59:17.3440503Z 2025-05-07T19:59:17.3441143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.3441802Z 2025-05-07T19:59:17.3443499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3446143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3447274Z ^ 2025-05-07T19:59:17.3447660Z 2025-05-07T19:59:17.3449181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3451847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3452912Z ^ 2025-05-07T19:59:17.3453172Z 2025-05-07T19:59:17.3453599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.3454297Z 2025-05-07T19:59:17.3455875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3458345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3459463Z ^ 2025-05-07T19:59:17.3459978Z 2025-05-07T19:59:17.3461528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3464191Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3465284Z ^ 2025-05-07T19:59:17.3465521Z 2025-05-07T19:59:17.3465971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.3466655Z 2025-05-07T19:59:17.3468247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3470860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3472055Z ^ 2025-05-07T19:59:17.3472421Z 2025-05-07T19:59:17.3474027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3476721Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3477825Z ^ 2025-05-07T19:59:17.3478079Z 2025-05-07T19:59:17.3478461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.3479074Z 2025-05-07T19:59:17.3480817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3483507Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3484795Z ^ 2025-05-07T19:59:17.3485235Z 2025-05-07T19:59:18.3778450Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:59:18.3802402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3805173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.3806396Z ^ 2025-05-07T19:59:18.3806651Z 2025-05-07T19:59:18.3807072Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.3808135Z 2025-05-07T19:59:18.3809861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3812855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.3814106Z ^ 2025-05-07T19:59:18.3814498Z 2025-05-07T19:59:18.3816334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3819296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.3820637Z ^ 2025-05-07T19:59:18.3820907Z 2025-05-07T19:59:18.3821374Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.3822299Z 2025-05-07T19:59:18.3824090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3826800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.3828000Z ^ 2025-05-07T19:59:18.3828379Z 2025-05-07T19:59:18.3830071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3832824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.3834045Z ^ 2025-05-07T19:59:18.3834287Z 2025-05-07T19:59:18.3834698Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.3835387Z 2025-05-07T19:59:18.3837083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3839814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.3841061Z ^ 2025-05-07T19:59:18.3841450Z 2025-05-07T19:59:18.3843170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3845867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.3847057Z ^ 2025-05-07T19:59:18.3847329Z 2025-05-07T19:59:18.3847778Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.3848461Z 2025-05-07T19:59:18.3850182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3852851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.3854060Z ^ 2025-05-07T19:59:18.3854731Z 2025-05-07T19:59:18.3856453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3859267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.3860637Z ^ 2025-05-07T19:59:18.3860897Z 2025-05-07T19:59:18.3861356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.3862036Z 2025-05-07T19:59:18.3863759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3866610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.3867804Z ^ 2025-05-07T19:59:18.3868187Z 2025-05-07T19:59:20.5730925Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:59:20.5754717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.5757896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.5759102Z ^ 2025-05-07T19:59:20.5759354Z 2025-05-07T19:59:20.5759803Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:20.5760494Z 2025-05-07T19:59:20.5762286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.5765093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.5766347Z ^ 2025-05-07T19:59:20.5766698Z 2025-05-07T19:59:20.5768518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.5771268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.5772460Z ^ 2025-05-07T19:59:20.5772728Z 2025-05-07T19:59:20.5773191Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:20.5773854Z 2025-05-07T19:59:20.5775581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.5778327Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.5779701Z ^ 2025-05-07T19:59:20.5780037Z 2025-05-07T19:59:20.5781614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.5784278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.5785462Z ^ 2025-05-07T19:59:20.5785709Z 2025-05-07T19:59:20.5786141Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:20.5786830Z 2025-05-07T19:59:20.5788533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.5791333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.5792552Z ^ 2025-05-07T19:59:20.5792935Z 2025-05-07T19:59:20.5794628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.5797409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.5798619Z ^ 2025-05-07T19:59:20.5798869Z 2025-05-07T19:59:20.5799345Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:20.5799905Z 2025-05-07T19:59:20.5801525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.5804409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.5805669Z ^ 2025-05-07T19:59:20.5806045Z 2025-05-07T19:59:20.5807861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.5810582Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.5811884Z ^ 2025-05-07T19:59:20.5812149Z 2025-05-07T19:59:20.5812692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:20.5813382Z 2025-05-07T19:59:20.5815158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.5817920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.5819086Z ^ 2025-05-07T19:59:20.5819438Z 2025-05-07T19:59:30.6264827Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:30.6287647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6290531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6291645Z ^ 2025-05-07T19:59:30.6291893Z 2025-05-07T19:59:30.6292321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.6292942Z 2025-05-07T19:59:30.6294889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6297381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6298451Z ^ 2025-05-07T19:59:30.6298773Z 2025-05-07T19:59:30.6300363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6302781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6303843Z ^ 2025-05-07T19:59:30.6304064Z 2025-05-07T19:59:30.6304490Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.6305097Z 2025-05-07T19:59:30.6306515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6308914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6310055Z ^ 2025-05-07T19:59:30.6310424Z 2025-05-07T19:59:30.6311854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6314167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6315207Z ^ 2025-05-07T19:59:30.6315450Z 2025-05-07T19:59:30.6315902Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.6316547Z 2025-05-07T19:59:30.6318090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6320564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6321720Z ^ 2025-05-07T19:59:30.6322373Z 2025-05-07T19:59:30.6324039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6326651Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6327874Z ^ 2025-05-07T19:59:30.6328134Z 2025-05-07T19:59:30.6328873Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.6329508Z 2025-05-07T19:59:30.6331158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6334060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6335260Z ^ 2025-05-07T19:59:30.6335627Z 2025-05-07T19:59:30.6337268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6340179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6341394Z ^ 2025-05-07T19:59:30.6341653Z 2025-05-07T19:59:30.6342130Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.6342813Z 2025-05-07T19:59:30.6344469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6347051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6348260Z ^ 2025-05-07T19:59:30.6348601Z 2025-05-07T19:59:33.5258615Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:33.5282045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.5284567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.5285932Z ^ 2025-05-07T19:59:33.5286174Z 2025-05-07T19:59:33.5286784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:33.5287445Z 2025-05-07T19:59:33.5289057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.5291685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.5292889Z ^ 2025-05-07T19:59:33.5293267Z 2025-05-07T19:59:33.5294856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.5297500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.5298649Z ^ 2025-05-07T19:59:33.5298944Z 2025-05-07T19:59:33.5299371Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:33.5300154Z 2025-05-07T19:59:33.5301772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.5304287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.5305464Z ^ 2025-05-07T19:59:33.5305798Z 2025-05-07T19:59:33.5307411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.5309903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.5311011Z ^ 2025-05-07T19:59:33.5311267Z 2025-05-07T19:59:33.5311693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:33.5312285Z 2025-05-07T19:59:33.5313735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.5316228Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.5317198Z ^ 2025-05-07T19:59:33.5317492Z 2025-05-07T19:59:33.5318928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.5321370Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.5322733Z ^ 2025-05-07T19:59:33.5322987Z 2025-05-07T19:59:33.5323617Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:33.5324250Z 2025-05-07T19:59:33.5325773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.5328226Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.5329525Z ^ 2025-05-07T19:59:33.5329866Z 2025-05-07T19:59:33.5331427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.5333977Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.5335108Z ^ 2025-05-07T19:59:33.5335346Z 2025-05-07T19:59:33.5335761Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:33.5336413Z 2025-05-07T19:59:33.5337903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.5340349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.5341429Z ^ 2025-05-07T19:59:33.5341778Z 2025-05-07T19:59:36.4068101Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:36.4090998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.4093597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.4094748Z ^ 2025-05-07T19:59:36.4095001Z 2025-05-07T19:59:36.4095444Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:36.4096105Z 2025-05-07T19:59:36.4097636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.4100196Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.4101348Z ^ 2025-05-07T19:59:36.4101708Z 2025-05-07T19:59:36.4103304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.4105879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.4106998Z ^ 2025-05-07T19:59:36.4107243Z 2025-05-07T19:59:36.4107679Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:36.4108302Z 2025-05-07T19:59:36.4109998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.4112539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.4113665Z ^ 2025-05-07T19:59:36.4114015Z 2025-05-07T19:59:36.4115470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.4118071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.4119232Z ^ 2025-05-07T19:59:36.4119473Z 2025-05-07T19:59:36.4119907Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:36.4120574Z 2025-05-07T19:59:36.4122330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.4125012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.4126376Z ^ 2025-05-07T19:59:36.4126737Z 2025-05-07T19:59:36.4128420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.4130988Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.4132087Z ^ 2025-05-07T19:59:36.4132327Z 2025-05-07T19:59:36.4132760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:36.4133448Z 2025-05-07T19:59:36.4135051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.4137593Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.4138669Z ^ 2025-05-07T19:59:36.4139034Z 2025-05-07T19:59:36.4140883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.4143392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.4144517Z ^ 2025-05-07T19:59:36.4144743Z 2025-05-07T19:59:36.4145147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:36.4145786Z 2025-05-07T19:59:36.4147420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.4149951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.4151038Z ^ 2025-05-07T19:59:36.4151363Z 2025-05-07T19:59:37.6294994Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:37.6320283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.6323461Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:37.6324657Z ^ 2025-05-07T19:59:37.6324913Z 2025-05-07T19:59:37.6325379Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:37.6326073Z 2025-05-07T19:59:37.6327623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.6330059Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:37.6331230Z ^ 2025-05-07T19:59:37.6331586Z 2025-05-07T19:59:37.6333210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.6335656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:37.6336855Z ^ 2025-05-07T19:59:37.6337101Z 2025-05-07T19:59:37.6337572Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:37.6338222Z 2025-05-07T19:59:37.6340023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.6342831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:37.6344071Z ^ 2025-05-07T19:59:37.6344439Z 2025-05-07T19:59:37.6346019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.6348658Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:37.6349861Z ^ 2025-05-07T19:59:37.6350115Z 2025-05-07T19:59:37.6350569Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:37.6351251Z 2025-05-07T19:59:37.6353223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.6356095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:37.6357268Z ^ 2025-05-07T19:59:37.6357620Z 2025-05-07T19:59:37.6359174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.6362096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:37.6363278Z ^ 2025-05-07T19:59:37.6363534Z 2025-05-07T19:59:37.6364001Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:37.6364676Z 2025-05-07T19:59:37.6366396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.6369176Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:37.6370364Z ^ 2025-05-07T19:59:37.6370749Z 2025-05-07T19:59:37.6372419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.6374893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:37.6376015Z ^ 2025-05-07T19:59:37.6376279Z 2025-05-07T19:59:37.6376718Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:37.6377367Z 2025-05-07T19:59:37.6379085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.6382026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:37.6383080Z ^ 2025-05-07T19:59:37.6383422Z 2025-05-07T19:59:45.6443668Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:45.6467227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6469891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.6471021Z ^ 2025-05-07T19:59:45.6471292Z 2025-05-07T19:59:45.6471744Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:45.6472412Z 2025-05-07T19:59:45.6474162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6477003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.6478185Z ^ 2025-05-07T19:59:45.6478551Z 2025-05-07T19:59:45.6480219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6482959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.6484125Z ^ 2025-05-07T19:59:45.6484369Z 2025-05-07T19:59:45.6484839Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:45.6485483Z 2025-05-07T19:59:45.6487181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6489967Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.6491219Z ^ 2025-05-07T19:59:45.6491596Z 2025-05-07T19:59:45.6493266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6496019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.6497403Z ^ 2025-05-07T19:59:45.6497659Z 2025-05-07T19:59:45.6498101Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:45.6498761Z 2025-05-07T19:59:45.6500733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6503489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.6504784Z ^ 2025-05-07T19:59:45.6505139Z 2025-05-07T19:59:45.6506975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6509766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.6511003Z ^ 2025-05-07T19:59:45.6511255Z 2025-05-07T19:59:45.6511702Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:45.6512376Z 2025-05-07T19:59:45.6514094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6516884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.6518097Z ^ 2025-05-07T19:59:45.6518477Z 2025-05-07T19:59:45.6520193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6523014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.6524184Z ^ 2025-05-07T19:59:45.6524435Z 2025-05-07T19:59:45.6524895Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:45.6525549Z 2025-05-07T19:59:45.6527102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6529767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.6530864Z ^ 2025-05-07T19:59:45.6531129Z 2025-05-07T19:59:49.7678978Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:49.7701969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:49.7704886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:49.7706101Z ^ 2025-05-07T19:59:49.7706365Z 2025-05-07T19:59:49.7706781Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:49.7707445Z 2025-05-07T19:59:49.7708980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:49.7711709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:49.7712945Z ^ 2025-05-07T19:59:49.7713308Z 2025-05-07T19:59:49.7715011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:49.7717789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:49.7718919Z ^ 2025-05-07T19:59:49.7719153Z 2025-05-07T19:59:49.7719582Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:49.7720241Z 2025-05-07T19:59:49.7721912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:49.7724785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:49.7725923Z ^ 2025-05-07T19:59:49.7726257Z 2025-05-07T19:59:49.7727794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:49.7730694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:49.7731992Z ^ 2025-05-07T19:59:49.7732274Z 2025-05-07T19:59:49.7732716Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:49.7733332Z 2025-05-07T19:59:49.7734903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:49.7737876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:49.7739045Z ^ 2025-05-07T19:59:49.7739376Z 2025-05-07T19:59:49.7741107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:49.7743776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:49.7744944Z ^ 2025-05-07T19:59:49.7745196Z 2025-05-07T19:59:49.7745644Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:49.7746323Z 2025-05-07T19:59:49.7748061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:49.7750767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:49.7751938Z ^ 2025-05-07T19:59:49.7752286Z 2025-05-07T19:59:49.7753834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:49.7756486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:49.7757580Z ^ 2025-05-07T19:59:49.7757827Z 2025-05-07T19:59:49.7758264Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:49.7758881Z 2025-05-07T19:59:49.7760555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:49.7763220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:49.7764455Z ^ 2025-05-07T19:59:49.7764805Z 2025-05-07T20:00:05.3328782Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T20:00:05.3352404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.3355141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.3356361Z ^ 2025-05-07T20:00:05.3356636Z 2025-05-07T20:00:05.3357097Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:05.3357714Z 2025-05-07T20:00:05.3359447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.3362097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.3363349Z ^ 2025-05-07T20:00:05.3363707Z 2025-05-07T20:00:05.3365126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3367254Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:05.3368031Z ^ 2025-05-07T20:00:05.3368322Z 2025-05-07T20:00:05.3369933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3371832Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3372347Z ^ 2025-05-07T20:00:05.3372846Z 2025-05-07T20:00:05.3374334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3376225Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3376726Z ^ 2025-05-07T20:00:05.3377103Z 2025-05-07T20:00:05.3378552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3380595Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3381180Z ^ 2025-05-07T20:00:05.3381439Z 2025-05-07T20:00:05.3383152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.3385659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.3386857Z ^ 2025-05-07T20:00:05.3387112Z 2025-05-07T20:00:05.3387595Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:05.3388270Z 2025-05-07T20:00:05.3389966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.3392576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.3393786Z ^ 2025-05-07T20:00:05.3394144Z 2025-05-07T20:00:05.3395663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3397806Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:05.3398542Z ^ 2025-05-07T20:00:05.3398824Z 2025-05-07T20:00:05.3400351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3402300Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3402814Z ^ 2025-05-07T20:00:05.3403083Z 2025-05-07T20:00:05.3404609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3406527Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3407077Z ^ 2025-05-07T20:00:05.3407359Z 2025-05-07T20:00:05.3408872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3410697Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3411208Z ^ 2025-05-07T20:00:05.3411455Z 2025-05-07T20:00:05.3413038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.3415594Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.3416899Z ^ 2025-05-07T20:00:05.3417131Z 2025-05-07T20:00:05.3417532Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:05.3418212Z 2025-05-07T20:00:05.3420127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.3423048Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.3424432Z ^ 2025-05-07T20:00:05.3424806Z 2025-05-07T20:00:05.3426461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3428596Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:05.3429252Z ^ 2025-05-07T20:00:05.3429550Z 2025-05-07T20:00:05.3431070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3432939Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3433451Z ^ 2025-05-07T20:00:05.3433703Z 2025-05-07T20:00:05.3435247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3437199Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3437753Z ^ 2025-05-07T20:00:05.3438030Z 2025-05-07T20:00:05.3439547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3441434Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3441933Z ^ 2025-05-07T20:00:05.3442208Z 2025-05-07T20:00:05.3443877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.3446585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.3447773Z ^ 2025-05-07T20:00:05.3448013Z 2025-05-07T20:00:05.3448434Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:05.3449024Z 2025-05-07T20:00:05.3450639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.3453454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.3454669Z ^ 2025-05-07T20:00:05.3455035Z 2025-05-07T20:00:05.3456449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3458512Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:05.3459238Z ^ 2025-05-07T20:00:05.3459928Z 2025-05-07T20:00:05.3461367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3463207Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3463942Z ^ 2025-05-07T20:00:05.3464198Z 2025-05-07T20:00:05.3465590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3467567Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3468107Z ^ 2025-05-07T20:00:05.3468368Z 2025-05-07T20:00:05.3469907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3493722Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3494366Z ^ 2025-05-07T20:00:05.3494617Z 2025-05-07T20:00:05.3496204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.3498759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.3499994Z ^ 2025-05-07T20:00:05.3500237Z 2025-05-07T20:00:05.3500631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:05.3501326Z 2025-05-07T20:00:05.3502931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.3505637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.3506789Z ^ 2025-05-07T20:00:05.3507157Z 2025-05-07T20:00:05.3508717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3510837Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:05.3511572Z ^ 2025-05-07T20:00:05.3511857Z 2025-05-07T20:00:05.3513265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3515148Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3515689Z ^ 2025-05-07T20:00:05.3515961Z 2025-05-07T20:00:05.3517456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3519361Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3519900Z ^ 2025-05-07T20:00:05.3520181Z 2025-05-07T20:00:05.3521708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.3523908Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.3524443Z ^ 2025-05-07T20:00:05.3525007Z 2025-05-07T20:00:05.5746265Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T20:00:05.5770392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.5773015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.5774231Z ^ 2025-05-07T20:00:05.5774479Z 2025-05-07T20:00:05.5774915Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:05.5775599Z 2025-05-07T20:00:05.5777240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.5779805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.5781037Z ^ 2025-05-07T20:00:05.5781405Z 2025-05-07T20:00:05.5783126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5785704Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:05.5786452Z ^ 2025-05-07T20:00:05.5786717Z 2025-05-07T20:00:05.5788324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5790229Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5790743Z ^ 2025-05-07T20:00:05.5791028Z 2025-05-07T20:00:05.5792685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5794636Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5795181Z ^ 2025-05-07T20:00:05.5795435Z 2025-05-07T20:00:05.5796972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5798637Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5799177Z ^ 2025-05-07T20:00:05.5799416Z 2025-05-07T20:00:05.5800922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.5803472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.5804560Z ^ 2025-05-07T20:00:05.5804799Z 2025-05-07T20:00:05.5805213Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:05.5805841Z 2025-05-07T20:00:05.5807402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.5809866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.5810952Z ^ 2025-05-07T20:00:05.5811302Z 2025-05-07T20:00:05.5812791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5814926Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:05.5815648Z ^ 2025-05-07T20:00:05.5815918Z 2025-05-07T20:00:05.5817396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5819244Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5819902Z ^ 2025-05-07T20:00:05.5820123Z 2025-05-07T20:00:05.5821563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5823786Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5824360Z ^ 2025-05-07T20:00:05.5824639Z 2025-05-07T20:00:05.5826258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5828622Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5829185Z ^ 2025-05-07T20:00:05.5829462Z 2025-05-07T20:00:05.5831293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.5834092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.5835374Z ^ 2025-05-07T20:00:05.5835645Z 2025-05-07T20:00:05.5836109Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:05.5836864Z 2025-05-07T20:00:05.5838620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.5841259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.5842430Z ^ 2025-05-07T20:00:05.5842770Z 2025-05-07T20:00:05.5844256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5846336Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:05.5847107Z ^ 2025-05-07T20:00:05.5847379Z 2025-05-07T20:00:05.5848942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5850849Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5851388Z ^ 2025-05-07T20:00:05.5851661Z 2025-05-07T20:00:05.5853201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5855156Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5855675Z ^ 2025-05-07T20:00:05.5855944Z 2025-05-07T20:00:05.5857462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5859357Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5860062Z ^ 2025-05-07T20:00:05.5860348Z 2025-05-07T20:00:05.5861979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.5864631Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.5865744Z ^ 2025-05-07T20:00:05.5865962Z 2025-05-07T20:00:05.5866374Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:05.5867036Z 2025-05-07T20:00:05.5868718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.5871352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.5872743Z ^ 2025-05-07T20:00:05.5873088Z 2025-05-07T20:00:05.5874793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5876953Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:05.5877641Z ^ 2025-05-07T20:00:05.5877912Z 2025-05-07T20:00:05.5879654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5881497Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5882010Z ^ 2025-05-07T20:00:05.5882283Z 2025-05-07T20:00:05.5883749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5885542Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5886042Z ^ 2025-05-07T20:00:05.5886298Z 2025-05-07T20:00:05.5887898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5889824Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5890339Z ^ 2025-05-07T20:00:05.5890608Z 2025-05-07T20:00:05.5892233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.5894931Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.5896112Z ^ 2025-05-07T20:00:05.5896359Z 2025-05-07T20:00:05.5896821Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:05.5897483Z 2025-05-07T20:00:05.5899128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:05.5901945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:05.5903101Z ^ 2025-05-07T20:00:05.5903449Z 2025-05-07T20:00:05.5905021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5907038Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:05.5907733Z ^ 2025-05-07T20:00:05.5908014Z 2025-05-07T20:00:05.5909543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5911333Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5911860Z ^ 2025-05-07T20:00:05.5912136Z 2025-05-07T20:00:05.5913703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5915771Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5916308Z ^ 2025-05-07T20:00:05.5916549Z 2025-05-07T20:00:05.5918159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:05.5920091Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:05.5920608Z ^ 2025-05-07T20:00:05.5920955Z 2025-05-07T20:00:06.6649013Z [323/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:00:11.3442127Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T20:00:11.3465858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3468562Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:11.3469778Z ^ 2025-05-07T20:00:11.3470170Z 2025-05-07T20:00:11.3470605Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:11.3471282Z 2025-05-07T20:00:11.3473010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3475782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:11.3476918Z ^ 2025-05-07T20:00:11.3477285Z 2025-05-07T20:00:11.3478952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3481565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:11.3482712Z ^ 2025-05-07T20:00:11.3482961Z 2025-05-07T20:00:11.3483388Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:11.3483996Z 2025-05-07T20:00:11.3485712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3488465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:11.3489652Z ^ 2025-05-07T20:00:11.3490022Z 2025-05-07T20:00:11.3491673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3494340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:11.3495473Z ^ 2025-05-07T20:00:11.3495730Z 2025-05-07T20:00:11.3496179Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:11.3496815Z 2025-05-07T20:00:11.3498483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3501351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:11.3502522Z ^ 2025-05-07T20:00:11.3502868Z 2025-05-07T20:00:11.3504554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3507180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:11.3508630Z ^ 2025-05-07T20:00:11.3508895Z 2025-05-07T20:00:11.3509375Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:11.3510021Z 2025-05-07T20:00:11.3511744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3514474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:11.3515805Z ^ 2025-05-07T20:00:11.3516165Z 2025-05-07T20:00:11.3517848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3520222Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:11.3521034Z ^ 2025-05-07T20:00:11.3521288Z 2025-05-07T20:00:11.3521689Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:11.3522569Z 2025-05-07T20:00:11.3524029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3526659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:11.3527679Z ^ 2025-05-07T20:00:11.3527967Z 2025-05-07T20:00:21.4084313Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T20:00:21.4107924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.4110902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.4112133Z ^ 2025-05-07T20:00:21.4112411Z 2025-05-07T20:00:21.4112862Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:21.4113542Z 2025-05-07T20:00:21.4115204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.4117889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.4119017Z ^ 2025-05-07T20:00:21.4119366Z 2025-05-07T20:00:21.4120974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.4124019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.4125219Z ^ 2025-05-07T20:00:21.4125458Z 2025-05-07T20:00:21.4125877Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:21.4126517Z 2025-05-07T20:00:21.4128139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.4130777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.4131911Z ^ 2025-05-07T20:00:21.4132300Z 2025-05-07T20:00:21.4133938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.4136625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.4137792Z ^ 2025-05-07T20:00:21.4138045Z 2025-05-07T20:00:21.4138511Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:21.4139163Z 2025-05-07T20:00:21.4140895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.4143595Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.4144859Z ^ 2025-05-07T20:00:21.4145218Z 2025-05-07T20:00:21.4146883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.4149332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.4150162Z ^ 2025-05-07T20:00:21.4150531Z 2025-05-07T20:00:21.4150931Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:21.4151588Z 2025-05-07T20:00:21.4153054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.4155778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.4156973Z ^ 2025-05-07T20:00:21.4157252Z 2025-05-07T20:00:21.4158859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.4161499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.4162701Z ^ 2025-05-07T20:00:21.4162950Z 2025-05-07T20:00:21.4163401Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:21.4164006Z 2025-05-07T20:00:21.4165710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.4168521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.4169793Z ^ 2025-05-07T20:00:21.4170169Z 2025-05-07T20:00:29.0631862Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T20:00:29.0644112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0645514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.0646141Z ^ 2025-05-07T20:00:29.0646283Z 2025-05-07T20:00:29.0646530Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:29.0646900Z 2025-05-07T20:00:29.0647754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0649155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.0649770Z ^ 2025-05-07T20:00:29.0649976Z 2025-05-07T20:00:29.0650818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0652209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.0652815Z ^ 2025-05-07T20:00:29.0652952Z 2025-05-07T20:00:29.0653201Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:29.0653546Z 2025-05-07T20:00:29.0654400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0655801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.0656430Z ^ 2025-05-07T20:00:29.0656621Z 2025-05-07T20:00:29.0657458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0658845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.0659459Z ^ 2025-05-07T20:00:29.0659594Z 2025-05-07T20:00:29.0659998Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:29.0660369Z 2025-05-07T20:00:29.0661235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0662715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.0663331Z ^ 2025-05-07T20:00:29.0663520Z 2025-05-07T20:00:29.0664410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0665775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.0666387Z ^ 2025-05-07T20:00:29.0668855Z 2025-05-07T20:00:29.0669112Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:29.0669459Z 2025-05-07T20:00:29.0670394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0671794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.0672427Z ^ 2025-05-07T20:00:29.0672621Z 2025-05-07T20:00:29.0673469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0674836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.0675445Z ^ 2025-05-07T20:00:29.0675594Z 2025-05-07T20:00:29.0675830Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:29.0676182Z 2025-05-07T20:00:29.0677056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0678442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.0679074Z ^ 2025-05-07T20:00:29.0679265Z 2025-05-07T20:00:32.4658611Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:32.4683001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4685491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.4686528Z ^ 2025-05-07T20:00:32.4686783Z 2025-05-07T20:00:32.4687194Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.4687811Z 2025-05-07T20:00:32.4689433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4692092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.4693222Z ^ 2025-05-07T20:00:32.4693562Z 2025-05-07T20:00:32.4695168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4697797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.4698948Z ^ 2025-05-07T20:00:32.4699196Z 2025-05-07T20:00:32.4699784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.4700441Z 2025-05-07T20:00:32.4702031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4704687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.4705913Z ^ 2025-05-07T20:00:32.4706283Z 2025-05-07T20:00:32.4708005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4710727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.4711878Z ^ 2025-05-07T20:00:32.4712112Z 2025-05-07T20:00:32.4712559Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.4713475Z 2025-05-07T20:00:32.4715098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4717721Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.4718891Z ^ 2025-05-07T20:00:32.4719233Z 2025-05-07T20:00:32.4720932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4724285Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.4725504Z ^ 2025-05-07T20:00:32.4725780Z 2025-05-07T20:00:32.4726208Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.4726820Z 2025-05-07T20:00:32.4728553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4731256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.4732483Z ^ 2025-05-07T20:00:32.4732853Z 2025-05-07T20:00:32.4734582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4737369Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.4738523Z ^ 2025-05-07T20:00:32.4738775Z 2025-05-07T20:00:32.4739235Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.4739978Z 2025-05-07T20:00:32.4741719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4744556Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.4745801Z ^ 2025-05-07T20:00:32.4746186Z 2025-05-07T20:00:32.9051799Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:32.9074007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9076610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9077676Z ^ 2025-05-07T20:00:32.9077939Z 2025-05-07T20:00:32.9078349Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.9078978Z 2025-05-07T20:00:32.9080569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9083072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9084183Z ^ 2025-05-07T20:00:32.9084543Z 2025-05-07T20:00:32.9086174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9088519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9089607Z ^ 2025-05-07T20:00:32.9089858Z 2025-05-07T20:00:32.9090269Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.9090872Z 2025-05-07T20:00:32.9092338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9094775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9095912Z ^ 2025-05-07T20:00:32.9096251Z 2025-05-07T20:00:32.9097758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9100691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9101795Z ^ 2025-05-07T20:00:32.9102051Z 2025-05-07T20:00:32.9102455Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.9103048Z 2025-05-07T20:00:32.9106315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9108835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9110092Z ^ 2025-05-07T20:00:32.9110543Z 2025-05-07T20:00:32.9112063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9114561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9115620Z ^ 2025-05-07T20:00:32.9115869Z 2025-05-07T20:00:32.9116245Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.9116853Z 2025-05-07T20:00:32.9118638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9121122Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9122533Z ^ 2025-05-07T20:00:32.9122885Z 2025-05-07T20:00:32.9124370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9126771Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9127857Z ^ 2025-05-07T20:00:32.9128124Z 2025-05-07T20:00:32.9128517Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.9129126Z 2025-05-07T20:00:32.9130654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9133106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9134384Z ^ 2025-05-07T20:00:32.9134698Z 2025-05-07T20:00:33.5042538Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:33.5062815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.5065373Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.5066409Z ^ 2025-05-07T20:00:33.5066637Z 2025-05-07T20:00:33.5067035Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:33.5067673Z 2025-05-07T20:00:33.5069139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.5071233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.5072341Z ^ 2025-05-07T20:00:33.5072701Z 2025-05-07T20:00:33.5074148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.5076390Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.5077481Z ^ 2025-05-07T20:00:33.5077736Z 2025-05-07T20:00:33.5078126Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:33.5078692Z 2025-05-07T20:00:33.5080071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.5082103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.5083373Z ^ 2025-05-07T20:00:33.5083640Z 2025-05-07T20:00:33.5084808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.5087367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.5088393Z ^ 2025-05-07T20:00:33.5088611Z 2025-05-07T20:00:33.5088935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:33.5089600Z 2025-05-07T20:00:33.5091115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.5093530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.5094635Z ^ 2025-05-07T20:00:33.5094958Z 2025-05-07T20:00:33.5096555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.5098937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.5100205Z ^ 2025-05-07T20:00:33.5100444Z 2025-05-07T20:00:33.5100804Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:33.5101275Z 2025-05-07T20:00:33.5102790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.5105300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.5106398Z ^ 2025-05-07T20:00:33.5106685Z 2025-05-07T20:00:33.5108085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.5110503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.5111467Z ^ 2025-05-07T20:00:33.5111647Z 2025-05-07T20:00:33.5111982Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:33.5112502Z 2025-05-07T20:00:33.5113807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.5116314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.5117383Z ^ 2025-05-07T20:00:33.5117687Z 2025-05-07T20:00:36.2359362Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T20:00:36.2383688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.2386478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.2387597Z ^ 2025-05-07T20:00:36.2387840Z 2025-05-07T20:00:36.2388282Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.2388978Z 2025-05-07T20:00:36.2390671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.2393521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.2394768Z ^ 2025-05-07T20:00:36.2395158Z 2025-05-07T20:00:36.2396908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.2399639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.2400857Z ^ 2025-05-07T20:00:36.2401121Z 2025-05-07T20:00:36.2401578Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.2402245Z 2025-05-07T20:00:36.2404007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.2406942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.2408142Z ^ 2025-05-07T20:00:36.2408504Z 2025-05-07T20:00:36.2410292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.2413036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.2414342Z ^ 2025-05-07T20:00:36.2414590Z 2025-05-07T20:00:36.2415085Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.2415795Z 2025-05-07T20:00:36.2417552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.2420503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.2421733Z ^ 2025-05-07T20:00:36.2422377Z 2025-05-07T20:00:36.2424115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.2426939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.2428159Z ^ 2025-05-07T20:00:36.2428419Z 2025-05-07T20:00:36.2428892Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.2429569Z 2025-05-07T20:00:36.2431281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.2434098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.2435348Z ^ 2025-05-07T20:00:36.2435718Z 2025-05-07T20:00:36.2437459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.2440266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.2441491Z ^ 2025-05-07T20:00:36.2441745Z 2025-05-07T20:00:36.2442201Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.2442908Z 2025-05-07T20:00:36.2444644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.2447474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.2448710Z ^ 2025-05-07T20:00:36.2449082Z 2025-05-07T20:00:41.1092505Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T20:00:41.1116517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1119220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:41.1120448Z ^ 2025-05-07T20:00:41.1120703Z 2025-05-07T20:00:41.1121149Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:41.1121735Z 2025-05-07T20:00:41.1123581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1126442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:41.1127689Z ^ 2025-05-07T20:00:41.1128060Z 2025-05-07T20:00:41.1129766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1132517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:41.1133642Z ^ 2025-05-07T20:00:41.1134410Z 2025-05-07T20:00:41.1134867Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:41.1135501Z 2025-05-07T20:00:41.1137358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1140205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:41.1141452Z ^ 2025-05-07T20:00:41.1141825Z 2025-05-07T20:00:41.1143598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1146407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:41.1147645Z ^ 2025-05-07T20:00:41.1147899Z 2025-05-07T20:00:41.1148353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:41.1149050Z 2025-05-07T20:00:41.1150792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1153570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:41.1154799Z ^ 2025-05-07T20:00:41.1155185Z 2025-05-07T20:00:41.1156925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1159634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:41.1160755Z ^ 2025-05-07T20:00:41.1161019Z 2025-05-07T20:00:41.1161476Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:41.1162111Z 2025-05-07T20:00:41.1163751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1166158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:41.1167358Z ^ 2025-05-07T20:00:41.1167703Z 2025-05-07T20:00:41.1169261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1171978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:41.1173226Z ^ 2025-05-07T20:00:41.1173474Z 2025-05-07T20:00:41.1173909Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:41.1174570Z 2025-05-07T20:00:41.1176230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1178960Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:41.1180473Z ^ 2025-05-07T20:00:41.1180860Z 2025-05-07T20:00:43.4687856Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T20:00:43.4706966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.4709114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.4710016Z ^ 2025-05-07T20:00:43.4710215Z 2025-05-07T20:00:43.4710536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:43.4711076Z 2025-05-07T20:00:43.4712619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.4715254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.4716392Z ^ 2025-05-07T20:00:43.4716725Z 2025-05-07T20:00:43.4718156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.4720802Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.4721847Z ^ 2025-05-07T20:00:43.4722288Z 2025-05-07T20:00:43.4722823Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:43.4723417Z 2025-05-07T20:00:43.4724920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.4727654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.4728727Z ^ 2025-05-07T20:00:43.4729085Z 2025-05-07T20:00:43.4730560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.4732747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.4733711Z ^ 2025-05-07T20:00:43.4733951Z 2025-05-07T20:00:43.4734337Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:43.4734887Z 2025-05-07T20:00:43.4736254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.4738496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.4739521Z ^ 2025-05-07T20:00:43.4739958Z 2025-05-07T20:00:43.4741311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.4743536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.4744535Z ^ 2025-05-07T20:00:43.4744748Z 2025-05-07T20:00:43.4745126Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:43.4745711Z 2025-05-07T20:00:43.4747093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.4749306Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.4750269Z ^ 2025-05-07T20:00:43.4750572Z 2025-05-07T20:00:43.4751970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.4754203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.4755237Z ^ 2025-05-07T20:00:43.4755470Z 2025-05-07T20:00:43.4755907Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:43.4756487Z 2025-05-07T20:00:43.4758059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.4760817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.4761995Z ^ 2025-05-07T20:00:43.4762320Z 2025-05-07T20:00:45.4965771Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:45.4989601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:45.4992278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:45.4993483Z ^ 2025-05-07T20:00:45.4993753Z 2025-05-07T20:00:45.4994218Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:45.4994859Z 2025-05-07T20:00:45.4996576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:45.4999184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:45.5000735Z ^ 2025-05-07T20:00:45.5001118Z 2025-05-07T20:00:45.5002947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:45.5005842Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:45.5006987Z ^ 2025-05-07T20:00:45.5007197Z 2025-05-07T20:00:45.5007693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:45.5008468Z 2025-05-07T20:00:45.5010291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:45.5012989Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:45.5014202Z ^ 2025-05-07T20:00:45.5014580Z 2025-05-07T20:00:45.5016219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:45.5019066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:45.5020526Z ^ 2025-05-07T20:00:45.5020790Z 2025-05-07T20:00:45.5021261Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:45.5022177Z 2025-05-07T20:00:45.5023947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:45.5026727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:45.5027982Z ^ 2025-05-07T20:00:45.5028360Z 2025-05-07T20:00:45.5030037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:45.5032772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:45.5034310Z ^ 2025-05-07T20:00:45.5034573Z 2025-05-07T20:00:45.5035062Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:45.5035765Z 2025-05-07T20:00:45.5037497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:45.5040292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:45.5041497Z ^ 2025-05-07T20:00:45.5041873Z 2025-05-07T20:00:45.5043571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:45.5046329Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:45.5047903Z ^ 2025-05-07T20:00:45.5048177Z 2025-05-07T20:00:45.5048643Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:45.5049308Z 2025-05-07T20:00:45.5051045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:45.5053704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:45.5054980Z ^ 2025-05-07T20:00:45.5055488Z 2025-05-07T20:00:49.9927389Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T20:00:49.9951659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.9954469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.9955710Z ^ 2025-05-07T20:00:49.9955972Z 2025-05-07T20:00:49.9956529Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:49.9957198Z 2025-05-07T20:00:49.9958904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.9962133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.9963472Z ^ 2025-05-07T20:00:49.9963851Z 2025-05-07T20:00:49.9965569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.9968404Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.9969692Z ^ 2025-05-07T20:00:49.9969984Z 2025-05-07T20:00:49.9970441Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:49.9971124Z 2025-05-07T20:00:49.9972889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.9975660Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.9976951Z ^ 2025-05-07T20:00:49.9977339Z 2025-05-07T20:00:49.9979091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.9982048Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.9983288Z ^ 2025-05-07T20:00:49.9983557Z 2025-05-07T20:00:49.9984029Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:49.9984739Z 2025-05-07T20:00:49.9986437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.9989242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.9990484Z ^ 2025-05-07T20:00:49.9990898Z 2025-05-07T20:00:49.9992640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.9995378Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.9996609Z ^ 2025-05-07T20:00:49.9996896Z 2025-05-07T20:00:49.9997372Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:49.9998066Z 2025-05-07T20:00:49.9999860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:50.0002569Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:50.0003780Z ^ 2025-05-07T20:00:50.0004145Z 2025-05-07T20:00:50.0005876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:50.0008940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:50.0010158Z ^ 2025-05-07T20:00:50.0010417Z 2025-05-07T20:00:50.0010950Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:50.0011657Z 2025-05-07T20:00:50.0013342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:50.0016369Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:50.0017631Z ^ 2025-05-07T20:00:50.0018048Z 2025-05-07T20:00:53.7874166Z [335/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T20:00:53.7895006Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:55.0512106Z [336/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T20:00:55.0532419Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:58.9819702Z [337/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T20:00:58.9844706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:58.9847561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:58.9848942Z ^ 2025-05-07T20:00:58.9849201Z 2025-05-07T20:00:58.9849793Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:58.9850489Z 2025-05-07T20:00:58.9852226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:58.9855062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:58.9856249Z ^ 2025-05-07T20:00:58.9856601Z 2025-05-07T20:00:58.9858311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:58.9861248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:58.9862433Z ^ 2025-05-07T20:00:58.9862694Z 2025-05-07T20:00:58.9863147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:58.9863762Z 2025-05-07T20:00:58.9865465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:58.9868280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:58.9869478Z ^ 2025-05-07T20:00:58.9869861Z 2025-05-07T20:00:58.9871438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:58.9874064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:58.9875271Z ^ 2025-05-07T20:00:58.9875538Z 2025-05-07T20:00:58.9875987Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:58.9876650Z 2025-05-07T20:00:58.9878291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:58.9881007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:58.9882182Z ^ 2025-05-07T20:00:58.9882540Z 2025-05-07T20:00:58.9884248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:58.9887195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:58.9888381Z ^ 2025-05-07T20:00:58.9888634Z 2025-05-07T20:00:58.9889200Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:58.9889908Z 2025-05-07T20:00:58.9891669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:58.9894537Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:58.9895819Z ^ 2025-05-07T20:00:58.9896194Z 2025-05-07T20:00:58.9897838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:58.9900652Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:58.9901763Z ^ 2025-05-07T20:00:58.9902023Z 2025-05-07T20:00:58.9902399Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:58.9903080Z 2025-05-07T20:00:58.9904695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:58.9907402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:58.9908499Z ^ 2025-05-07T20:00:58.9908856Z 2025-05-07T20:00:59.7966326Z [338/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T20:00:59.7987017Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:03.2406222Z [339/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T20:01:03.2427620Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:03.8459730Z [340/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:03.8483448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.8486259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.8487451Z ^ 2025-05-07T20:01:03.8487717Z 2025-05-07T20:01:03.8488181Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:03.8488855Z 2025-05-07T20:01:03.8490576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.8493204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.8494460Z ^ 2025-05-07T20:01:03.8494832Z 2025-05-07T20:01:03.8496420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.8499125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.8500492Z ^ 2025-05-07T20:01:03.8500723Z 2025-05-07T20:01:03.8501177Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:03.8501876Z 2025-05-07T20:01:03.8503620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.8506407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.8507631Z ^ 2025-05-07T20:01:03.8507987Z 2025-05-07T20:01:03.8509614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.8512637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.8513815Z ^ 2025-05-07T20:01:03.8514054Z 2025-05-07T20:01:03.8514501Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:03.8515302Z 2025-05-07T20:01:03.8517028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.8519906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.8521170Z ^ 2025-05-07T20:01:03.8521501Z 2025-05-07T20:01:03.8523395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.8526178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.8527317Z ^ 2025-05-07T20:01:03.8527574Z 2025-05-07T20:01:03.8528024Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:03.8528567Z 2025-05-07T20:01:03.8530200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.8532851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.8534013Z ^ 2025-05-07T20:01:03.8534367Z 2025-05-07T20:01:03.8535922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.8538513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.8539833Z ^ 2025-05-07T20:01:03.8540079Z 2025-05-07T20:01:03.8540527Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:03.8541158Z 2025-05-07T20:01:03.8542802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.8545379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.8546510Z ^ 2025-05-07T20:01:03.8546859Z 2025-05-07T20:01:04.5847992Z [341/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:04.5872255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.5875061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.5876293Z ^ 2025-05-07T20:01:04.5876548Z 2025-05-07T20:01:04.5877020Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.5877646Z 2025-05-07T20:01:04.5879294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.5882084Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.5883311Z ^ 2025-05-07T20:01:04.5883689Z 2025-05-07T20:01:04.5885347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.5888093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.5889244Z ^ 2025-05-07T20:01:04.5889498Z 2025-05-07T20:01:04.5889964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.5890538Z 2025-05-07T20:01:04.5892141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.5894845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.5896246Z ^ 2025-05-07T20:01:04.5896615Z 2025-05-07T20:01:04.5898229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.5901076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.5902215Z ^ 2025-05-07T20:01:04.5902466Z 2025-05-07T20:01:04.5902897Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.5903622Z 2025-05-07T20:01:04.5905303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.5907860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.5909017Z ^ 2025-05-07T20:01:04.5909381Z 2025-05-07T20:01:04.5910877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.5913568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.5914742Z ^ 2025-05-07T20:01:04.5915007Z 2025-05-07T20:01:04.5915458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.5916147Z 2025-05-07T20:01:04.5917862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.5920622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.5921824Z ^ 2025-05-07T20:01:04.5922443Z 2025-05-07T20:01:04.5924153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.5926884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.5928001Z ^ 2025-05-07T20:01:04.5928266Z 2025-05-07T20:01:04.5928713Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.5929344Z 2025-05-07T20:01:04.5930910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.5933584Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.5934726Z ^ 2025-05-07T20:01:04.5935066Z 2025-05-07T20:01:04.6527243Z [342/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T20:01:04.6548466Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:05.2120902Z [343/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T20:01:05.2144958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.2147543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.2148511Z ^ 2025-05-07T20:01:05.2148713Z 2025-05-07T20:01:05.2149107Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.2149689Z 2025-05-07T20:01:05.2151141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.2153609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.2154664Z ^ 2025-05-07T20:01:05.2154991Z 2025-05-07T20:01:05.2156610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.2159126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.2160208Z ^ 2025-05-07T20:01:05.2160449Z 2025-05-07T20:01:05.2160858Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.2161452Z 2025-05-07T20:01:05.2163025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.2165851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.2167118Z ^ 2025-05-07T20:01:05.2167491Z 2025-05-07T20:01:05.2169241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.2172040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.2173269Z ^ 2025-05-07T20:01:05.2173533Z 2025-05-07T20:01:05.2173994Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.2174712Z 2025-05-07T20:01:05.2176465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.2179302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.2180761Z ^ 2025-05-07T20:01:05.2181485Z 2025-05-07T20:01:05.2183211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.2186110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.2187304Z ^ 2025-05-07T20:01:05.2187566Z 2025-05-07T20:01:05.2187984Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.2188650Z 2025-05-07T20:01:05.2190490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.2193228Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.2194437Z ^ 2025-05-07T20:01:05.2194797Z 2025-05-07T20:01:05.2196474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.2199184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.2200374Z ^ 2025-05-07T20:01:05.2200622Z 2025-05-07T20:01:05.2201072Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.2201751Z 2025-05-07T20:01:05.2203446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.2205954Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.2207034Z ^ 2025-05-07T20:01:05.2207378Z 2025-05-07T20:01:08.3012970Z [344/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:08.3033100Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:09.5116118Z [345/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T20:01:09.5133860Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:10.8617981Z [346/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:10.8639956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.8642428Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.8643481Z ^ 2025-05-07T20:01:10.8643749Z 2025-05-07T20:01:10.8644205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:10.8644800Z 2025-05-07T20:01:10.8646320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.8648836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.8649940Z ^ 2025-05-07T20:01:10.8650233Z 2025-05-07T20:01:10.8651549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.8653838Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.8654868Z ^ 2025-05-07T20:01:10.8655096Z 2025-05-07T20:01:10.8655434Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:10.8656034Z 2025-05-07T20:01:10.8657556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.8660113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.8661476Z ^ 2025-05-07T20:01:10.8661806Z 2025-05-07T20:01:10.8663251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.8665815Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.8666937Z ^ 2025-05-07T20:01:10.8667179Z 2025-05-07T20:01:10.8667570Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:10.8668285Z 2025-05-07T20:01:10.8671029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.8673513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.8674626Z ^ 2025-05-07T20:01:10.8674986Z 2025-05-07T20:01:10.8676474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.8678990Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.8680039Z ^ 2025-05-07T20:01:10.8680267Z 2025-05-07T20:01:10.8680680Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:10.8681319Z 2025-05-07T20:01:10.8682852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.8685383Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.8686447Z ^ 2025-05-07T20:01:10.8686760Z 2025-05-07T20:01:10.8688149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.8690555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.8691653Z ^ 2025-05-07T20:01:10.8691888Z 2025-05-07T20:01:10.8692277Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:10.8692875Z 2025-05-07T20:01:10.8694356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.8696705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.8697725Z ^ 2025-05-07T20:01:10.8698044Z 2025-05-07T20:01:12.4564727Z [347/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:12.4584220Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:12.5942723Z [348/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:12.5965088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.5968189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.5969323Z ^ 2025-05-07T20:01:12.5969579Z 2025-05-07T20:01:12.5969972Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:12.5970583Z 2025-05-07T20:01:12.5972146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.5974987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.5976210Z ^ 2025-05-07T20:01:12.5976583Z 2025-05-07T20:01:12.5978425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.5981363Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.5982568Z ^ 2025-05-07T20:01:12.5982822Z 2025-05-07T20:01:12.5983294Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:12.5983984Z 2025-05-07T20:01:12.5985748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.5988589Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.5989837Z ^ 2025-05-07T20:01:12.5990213Z 2025-05-07T20:01:12.5991908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.5994619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.5995829Z ^ 2025-05-07T20:01:12.5996087Z 2025-05-07T20:01:12.5996545Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:12.6011431Z 2025-05-07T20:01:12.6013052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.6015503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.6016562Z ^ 2025-05-07T20:01:12.6016930Z 2025-05-07T20:01:12.6018669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.6021786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.6023362Z ^ 2025-05-07T20:01:12.6023642Z 2025-05-07T20:01:12.6024102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:12.6024796Z 2025-05-07T20:01:12.6026565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.6029566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.6030826Z ^ 2025-05-07T20:01:12.6031200Z 2025-05-07T20:01:12.6032937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.6035595Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.6036723Z ^ 2025-05-07T20:01:12.6036964Z 2025-05-07T20:01:12.6037405Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:12.6038101Z 2025-05-07T20:01:12.6039784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.6042564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.6043784Z ^ 2025-05-07T20:01:12.6044169Z 2025-05-07T20:01:12.8316560Z [349/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T20:01:12.8337532Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:14.0379867Z [350/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T20:01:14.0399338Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:14.3279693Z [351/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:14.3300485Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:14.3426910Z [352/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:14.3446591Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:19.3315911Z [353/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:19.3335074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.3337319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.3338276Z ^ 2025-05-07T20:01:19.3338483Z 2025-05-07T20:01:19.3338846Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.3339388Z 2025-05-07T20:01:19.3340818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.3342963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.3343917Z ^ 2025-05-07T20:01:19.3344224Z 2025-05-07T20:01:19.3345509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.3347610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.3348533Z ^ 2025-05-07T20:01:19.3348735Z 2025-05-07T20:01:19.3349099Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.3349929Z 2025-05-07T20:01:19.3351234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.3353462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.3354410Z ^ 2025-05-07T20:01:19.3354693Z 2025-05-07T20:01:19.3355979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.3358219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.3359160Z ^ 2025-05-07T20:01:19.3359357Z 2025-05-07T20:01:19.3359710Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.3360242Z 2025-05-07T20:01:19.3361535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.3363519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.3364366Z ^ 2025-05-07T20:01:19.3364653Z 2025-05-07T20:01:19.3365902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.3367929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.3368832Z ^ 2025-05-07T20:01:19.3369020Z 2025-05-07T20:01:19.3369383Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.3369885Z 2025-05-07T20:01:19.3371154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.3373182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.3374113Z ^ 2025-05-07T20:01:19.3374399Z 2025-05-07T20:01:19.3375628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.3377624Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.3378495Z ^ 2025-05-07T20:01:19.3378705Z 2025-05-07T20:01:19.3379059Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.3379701Z 2025-05-07T20:01:19.3380959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.3382959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.3383882Z ^ 2025-05-07T20:01:19.3384326Z 2025-05-07T20:01:23.1086530Z [354/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T20:01:23.1108438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.1111017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.1112168Z ^ 2025-05-07T20:01:23.1112434Z 2025-05-07T20:01:23.1112829Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.1113463Z 2025-05-07T20:01:23.1114920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.1117375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.1118562Z ^ 2025-05-07T20:01:23.1118918Z 2025-05-07T20:01:23.1120532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.1123695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.1124800Z ^ 2025-05-07T20:01:23.1125028Z 2025-05-07T20:01:23.1125444Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.1126042Z 2025-05-07T20:01:23.1127802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.1130466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.1131654Z ^ 2025-05-07T20:01:23.1132021Z 2025-05-07T20:01:23.1133781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.1136354Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.1137505Z ^ 2025-05-07T20:01:23.1137767Z 2025-05-07T20:01:23.1138218Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.1138884Z 2025-05-07T20:01:23.1140530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.1142973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.1144126Z ^ 2025-05-07T20:01:23.1144489Z 2025-05-07T20:01:23.1146061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.1148588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.1149715Z ^ 2025-05-07T20:01:23.1149929Z 2025-05-07T20:01:23.1150307Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.1150898Z 2025-05-07T20:01:23.1152397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.1154809Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.1155883Z ^ 2025-05-07T20:01:23.1156219Z 2025-05-07T20:01:23.1157718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.1160274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.1161401Z ^ 2025-05-07T20:01:23.1161638Z 2025-05-07T20:01:23.1162089Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.1162731Z 2025-05-07T20:01:23.1164305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.1167151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.1168310Z ^ 2025-05-07T20:01:23.1168651Z 2025-05-07T20:01:23.7938774Z [355/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T20:01:23.7960262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.7962923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.7964042Z ^ 2025-05-07T20:01:23.7964366Z 2025-05-07T20:01:23.7964779Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.7965428Z 2025-05-07T20:01:23.7967074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.7969512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.7970627Z ^ 2025-05-07T20:01:23.7971273Z 2025-05-07T20:01:23.7972798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.7975537Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.7976610Z ^ 2025-05-07T20:01:23.7976866Z 2025-05-07T20:01:23.7977261Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.7977863Z 2025-05-07T20:01:23.7979813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.7982213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.7983259Z ^ 2025-05-07T20:01:23.7983663Z 2025-05-07T20:01:23.7984974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.7987340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.7988440Z ^ 2025-05-07T20:01:23.7988700Z 2025-05-07T20:01:23.7989137Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.7989763Z 2025-05-07T20:01:23.7991227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.7993781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.7994953Z ^ 2025-05-07T20:01:23.7995317Z 2025-05-07T20:01:23.7996705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.7999098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.8000105Z ^ 2025-05-07T20:01:23.8000370Z 2025-05-07T20:01:23.8000797Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.8001380Z 2025-05-07T20:01:23.8002894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.8005260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.8006387Z ^ 2025-05-07T20:01:23.8006731Z 2025-05-07T20:01:23.8008226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.8010730Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.8011734Z ^ 2025-05-07T20:01:23.8012128Z 2025-05-07T20:01:23.8012515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.8013096Z 2025-05-07T20:01:23.8014532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.8016977Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.8018022Z ^ 2025-05-07T20:01:23.8018349Z 2025-05-07T20:01:26.2339162Z [356/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T20:01:26.2363155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.2366122Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.2367245Z ^ 2025-05-07T20:01:26.2367528Z 2025-05-07T20:01:26.2368025Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.2368764Z 2025-05-07T20:01:26.2370382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.2373421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.2374651Z ^ 2025-05-07T20:01:26.2375016Z 2025-05-07T20:01:26.2376987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.2379853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.2381162Z ^ 2025-05-07T20:01:26.2381440Z 2025-05-07T20:01:26.2382030Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.2382762Z 2025-05-07T20:01:26.2384444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.2387297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.2388608Z ^ 2025-05-07T20:01:26.2388998Z 2025-05-07T20:01:26.2390770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.2393683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.2394970Z ^ 2025-05-07T20:01:26.2395262Z 2025-05-07T20:01:26.2395747Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.2396469Z 2025-05-07T20:01:26.2398278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.2401244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.2402543Z ^ 2025-05-07T20:01:26.2402954Z 2025-05-07T20:01:26.2404717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.2407643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.2408916Z ^ 2025-05-07T20:01:26.2409205Z 2025-05-07T20:01:26.2409682Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.2410398Z 2025-05-07T20:01:26.2412208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.2414968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.2416252Z ^ 2025-05-07T20:01:26.2416645Z 2025-05-07T20:01:26.2418434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.2421751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.2423193Z ^ 2025-05-07T20:01:26.2423472Z 2025-05-07T20:01:26.2423948Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.2424860Z 2025-05-07T20:01:26.2426671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.2429592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.2431143Z ^ 2025-05-07T20:01:26.2431553Z 2025-05-07T20:01:26.3703022Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:26.3729388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.3732342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.3733976Z ^ 2025-05-07T20:01:26.3734253Z 2025-05-07T20:01:26.3734740Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.3735476Z 2025-05-07T20:01:26.3737421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.3740441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.3741804Z ^ 2025-05-07T20:01:26.3742343Z 2025-05-07T20:01:26.3744220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.3747111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.3748396Z ^ 2025-05-07T20:01:26.3748673Z 2025-05-07T20:01:26.3749165Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.3749876Z 2025-05-07T20:01:26.3751670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.3754364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.3755670Z ^ 2025-05-07T20:01:26.3756052Z 2025-05-07T20:01:26.3757816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.3760709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.3762004Z ^ 2025-05-07T20:01:26.3762278Z 2025-05-07T20:01:26.3762768Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.3763499Z 2025-05-07T20:01:26.3765286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.3768203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.3769492Z ^ 2025-05-07T20:01:26.3769899Z 2025-05-07T20:01:26.3771693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.3774578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.3775888Z ^ 2025-05-07T20:01:26.3776163Z 2025-05-07T20:01:26.3776615Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.3777338Z 2025-05-07T20:01:26.3779127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.3782250Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.3783723Z ^ 2025-05-07T20:01:26.3784119Z 2025-05-07T20:01:26.3786039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.3788940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.3790213Z ^ 2025-05-07T20:01:26.3790504Z 2025-05-07T20:01:26.3790982Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.3791806Z 2025-05-07T20:01:26.3793700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.3796609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.3797912Z ^ 2025-05-07T20:01:26.3798301Z 2025-05-07T20:01:26.8976357Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:26.9002829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.9006075Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.9007512Z ^ 2025-05-07T20:01:26.9007808Z 2025-05-07T20:01:26.9008293Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.9009027Z 2025-05-07T20:01:26.9010815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.9013975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.9015273Z ^ 2025-05-07T20:01:26.9015682Z 2025-05-07T20:01:26.9017465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.9020552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.9021808Z ^ 2025-05-07T20:01:26.9022356Z 2025-05-07T20:01:26.9022838Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.9023560Z 2025-05-07T20:01:26.9025370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.9028248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.9029539Z ^ 2025-05-07T20:01:26.9029929Z 2025-05-07T20:01:26.9031706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.9034605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.9035886Z ^ 2025-05-07T20:01:26.9036167Z 2025-05-07T20:01:26.9036641Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.9037373Z 2025-05-07T20:01:26.9039169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.9041591Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.9042815Z ^ 2025-05-07T20:01:26.9043234Z 2025-05-07T20:01:26.9044992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.9047757Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.9049019Z ^ 2025-05-07T20:01:26.9049311Z 2025-05-07T20:01:26.9049806Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.9050735Z 2025-05-07T20:01:26.9052527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.9055434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.9056736Z ^ 2025-05-07T20:01:26.9057124Z 2025-05-07T20:01:26.9058841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.9062129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.9063432Z ^ 2025-05-07T20:01:26.9063716Z 2025-05-07T20:01:26.9064194Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:26.9064908Z 2025-05-07T20:01:26.9066548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:26.9069368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:26.9070656Z ^ 2025-05-07T20:01:26.9071049Z 2025-05-07T20:01:29.0118444Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T20:01:29.0144495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.0147618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.0148944Z ^ 2025-05-07T20:01:29.0149223Z 2025-05-07T20:01:29.0149841Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.0150580Z 2025-05-07T20:01:29.0152279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.0155184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.0156490Z ^ 2025-05-07T20:01:29.0156898Z 2025-05-07T20:01:29.0158670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.0161566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.0162846Z ^ 2025-05-07T20:01:29.0163137Z 2025-05-07T20:01:29.0163621Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.0164347Z 2025-05-07T20:01:29.0166134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.0169095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.0170394Z ^ 2025-05-07T20:01:29.0170793Z 2025-05-07T20:01:29.0172573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.0175504Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.0176823Z ^ 2025-05-07T20:01:29.0177098Z 2025-05-07T20:01:29.0177574Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.0178325Z 2025-05-07T20:01:29.0180317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.0183237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.0184539Z ^ 2025-05-07T20:01:29.0184926Z 2025-05-07T20:01:29.0186719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.0189827Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.0191118Z ^ 2025-05-07T20:01:29.0191392Z 2025-05-07T20:01:29.0191885Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.0192699Z 2025-05-07T20:01:29.0194503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.0197404Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.0198810Z ^ 2025-05-07T20:01:29.0199283Z 2025-05-07T20:01:29.0201058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.0203927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.0205205Z ^ 2025-05-07T20:01:29.0205485Z 2025-05-07T20:01:29.0205959Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.0206664Z 2025-05-07T20:01:29.0208475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.0211405Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.0212714Z ^ 2025-05-07T20:01:29.0213115Z 2025-05-07T20:01:30.8833556Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T20:01:30.8858844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.8862379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.8863700Z ^ 2025-05-07T20:01:30.8863978Z 2025-05-07T20:01:30.8864464Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.8865198Z 2025-05-07T20:01:30.8867010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.8869934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.8871227Z ^ 2025-05-07T20:01:30.8871631Z 2025-05-07T20:01:30.8873422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.8876297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.8877597Z ^ 2025-05-07T20:01:30.8877877Z 2025-05-07T20:01:30.8878373Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.8879092Z 2025-05-07T20:01:30.8880910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.8883824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.8885144Z ^ 2025-05-07T20:01:30.8885546Z 2025-05-07T20:01:30.8887328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.8890241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.8891534Z ^ 2025-05-07T20:01:30.8891837Z 2025-05-07T20:01:30.8892328Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.8893055Z 2025-05-07T20:01:30.8894888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.8897858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.8899185Z ^ 2025-05-07T20:01:30.8899953Z 2025-05-07T20:01:30.8901720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.8904688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.8905975Z ^ 2025-05-07T20:01:30.8906246Z 2025-05-07T20:01:30.8906720Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.8907431Z 2025-05-07T20:01:30.8909305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.8912263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.8913554Z ^ 2025-05-07T20:01:30.8913956Z 2025-05-07T20:01:30.8915737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.8918620Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.8919894Z ^ 2025-05-07T20:01:30.8920189Z 2025-05-07T20:01:30.8920674Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.8921393Z 2025-05-07T20:01:30.8923474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.8926125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.8927420Z ^ 2025-05-07T20:01:30.8927803Z 2025-05-07T20:01:36.3088190Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T20:01:36.3114290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.3117113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.3118395Z ^ 2025-05-07T20:01:36.3118678Z 2025-05-07T20:01:36.3119181Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:36.3119893Z 2025-05-07T20:01:36.3121682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.3124988Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.3126305Z ^ 2025-05-07T20:01:36.3126719Z 2025-05-07T20:01:36.3128497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.3131374Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.3132642Z ^ 2025-05-07T20:01:36.3132917Z 2025-05-07T20:01:36.3133418Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:36.3134142Z 2025-05-07T20:01:36.3135936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.3138908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.3140413Z ^ 2025-05-07T20:01:36.3140806Z 2025-05-07T20:01:36.3142597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.3145460Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.3146739Z ^ 2025-05-07T20:01:36.3147024Z 2025-05-07T20:01:36.3147471Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:36.3148146Z 2025-05-07T20:01:36.3149950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.3153167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.3154470Z ^ 2025-05-07T20:01:36.3154866Z 2025-05-07T20:01:36.3156831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.3159727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.3161148Z ^ 2025-05-07T20:01:36.3161436Z 2025-05-07T20:01:36.3162031Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:36.3162751Z 2025-05-07T20:01:36.3164566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.3167463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.3168764Z ^ 2025-05-07T20:01:36.3169162Z 2025-05-07T20:01:36.3170946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.3173839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.3175141Z ^ 2025-05-07T20:01:36.3175423Z 2025-05-07T20:01:36.3175903Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:36.3176632Z 2025-05-07T20:01:36.3178451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.3181554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.3182834Z ^ 2025-05-07T20:01:36.3183238Z 2025-05-07T20:01:39.2425191Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T20:01:39.2451696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.2454654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.2455976Z ^ 2025-05-07T20:01:39.2456270Z 2025-05-07T20:01:39.2456777Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:39.2457519Z 2025-05-07T20:01:39.2459342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.2462459Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.2463767Z ^ 2025-05-07T20:01:39.2464181Z 2025-05-07T20:01:39.2465951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.2468857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.2470142Z ^ 2025-05-07T20:01:39.2470450Z 2025-05-07T20:01:39.2470932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:39.2471672Z 2025-05-07T20:01:39.2473497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.2476428Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.2477747Z ^ 2025-05-07T20:01:39.2478142Z 2025-05-07T20:01:39.2479954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.2482886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.2484373Z ^ 2025-05-07T20:01:39.2484648Z 2025-05-07T20:01:39.2485120Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:39.2485860Z 2025-05-07T20:01:39.2487773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.2490697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.2491987Z ^ 2025-05-07T20:01:39.2493399Z 2025-05-07T20:01:39.2495243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.2498147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.2499441Z ^ 2025-05-07T20:01:39.2499916Z 2025-05-07T20:01:39.2500395Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:39.2501113Z 2025-05-07T20:01:39.2502930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.2505844Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.2507160Z ^ 2025-05-07T20:01:39.2507550Z 2025-05-07T20:01:39.2509331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.2512237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.2513536Z ^ 2025-05-07T20:01:39.2513818Z 2025-05-07T20:01:39.2514318Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:39.2515068Z 2025-05-07T20:01:39.2516886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.2519841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.2521127Z ^ 2025-05-07T20:01:39.2521545Z 2025-05-07T20:01:51.6821676Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:51.6844526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.6846976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.6848050Z ^ 2025-05-07T20:01:51.6848272Z 2025-05-07T20:01:51.6848699Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.6849374Z 2025-05-07T20:01:51.6850937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.6853368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.6854404Z ^ 2025-05-07T20:01:51.6854720Z 2025-05-07T20:01:51.6856185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.6858526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.6859695Z ^ 2025-05-07T20:01:51.6859927Z 2025-05-07T20:01:51.6860332Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.6860939Z 2025-05-07T20:01:51.6862451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.6864939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.6866050Z ^ 2025-05-07T20:01:51.6866340Z 2025-05-07T20:01:51.6867865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.6870638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.6871803Z ^ 2025-05-07T20:01:51.6872282Z 2025-05-07T20:01:51.6872734Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.6873401Z 2025-05-07T20:01:51.6874984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.6877590Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.6878668Z ^ 2025-05-07T20:01:51.6879008Z 2025-05-07T20:01:51.6880473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.6882966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.6884056Z ^ 2025-05-07T20:01:51.6884312Z 2025-05-07T20:01:51.6884692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.6885315Z 2025-05-07T20:01:51.6886888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.6889389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.6890523Z ^ 2025-05-07T20:01:51.6890880Z 2025-05-07T20:01:51.6892458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.6895142Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.6896376Z ^ 2025-05-07T20:01:51.6896637Z 2025-05-07T20:01:51.6897063Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.6897620Z 2025-05-07T20:01:51.6899069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.6901767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.6902799Z ^ 2025-05-07T20:01:51.6903141Z 2025-05-07T20:01:53.7515506Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T20:01:53.7539973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:53.7542361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:53.7543374Z ^ 2025-05-07T20:01:53.7543617Z 2025-05-07T20:01:53.7544006Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:53.7544641Z 2025-05-07T20:01:53.7546141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:53.7548704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:53.7549786Z ^ 2025-05-07T20:01:53.7550130Z 2025-05-07T20:01:53.7551629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:53.7554292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:53.7555374Z ^ 2025-05-07T20:01:53.7555601Z 2025-05-07T20:01:53.7556014Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:53.7556631Z 2025-05-07T20:01:53.7558188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:53.7561113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:53.7562318Z ^ 2025-05-07T20:01:53.7562681Z 2025-05-07T20:01:53.7564504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:53.7567288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:53.7568538Z ^ 2025-05-07T20:01:53.7568772Z 2025-05-07T20:01:53.7569212Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:53.7569958Z 2025-05-07T20:01:53.7571513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:53.7574031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:53.7575190Z ^ 2025-05-07T20:01:53.7575536Z 2025-05-07T20:01:53.7577138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:53.7580034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:53.7581274Z ^ 2025-05-07T20:01:53.7581530Z 2025-05-07T20:01:53.7582005Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:53.7582694Z 2025-05-07T20:01:53.7584178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:53.7586892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:53.7588049Z ^ 2025-05-07T20:01:53.7588417Z 2025-05-07T20:01:53.7590113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:53.7592849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:53.7593990Z ^ 2025-05-07T20:01:53.7594244Z 2025-05-07T20:01:53.7594572Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:53.7595116Z 2025-05-07T20:01:53.7596626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:53.7599340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:53.7600526Z ^ 2025-05-07T20:01:53.7600857Z 2025-05-07T20:02:01.5369265Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:01.5393649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5396268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5397435Z ^ 2025-05-07T20:02:01.5397772Z 2025-05-07T20:02:01.5398254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.5398911Z 2025-05-07T20:02:01.5400512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5403086Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5404267Z ^ 2025-05-07T20:02:01.5404623Z 2025-05-07T20:02:01.5406280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5409055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5410205Z ^ 2025-05-07T20:02:01.5410772Z 2025-05-07T20:02:01.5411205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.5411856Z 2025-05-07T20:02:01.5413609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5416267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5417476Z ^ 2025-05-07T20:02:01.5417827Z 2025-05-07T20:02:01.5419872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5422858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5424038Z ^ 2025-05-07T20:02:01.5424288Z 2025-05-07T20:02:01.5424758Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.5425431Z 2025-05-07T20:02:01.5427072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5429629Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5430798Z ^ 2025-05-07T20:02:01.5431110Z 2025-05-07T20:02:01.5432488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5434898Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5436001Z ^ 2025-05-07T20:02:01.5436253Z 2025-05-07T20:02:01.5436688Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.5437304Z 2025-05-07T20:02:01.5438881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5441358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5442567Z ^ 2025-05-07T20:02:01.5442898Z 2025-05-07T20:02:01.5444392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5446880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5448072Z ^ 2025-05-07T20:02:01.5448322Z 2025-05-07T20:02:01.5448776Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.5449464Z 2025-05-07T20:02:01.5451156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5453923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5455430Z ^ 2025-05-07T20:02:01.5455788Z 2025-05-07T20:02:04.1954361Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:04.1978470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.1981389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.1982599Z ^ 2025-05-07T20:02:04.1982865Z 2025-05-07T20:02:04.1983342Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:04.1984019Z 2025-05-07T20:02:04.1985776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.1988592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.1989833Z ^ 2025-05-07T20:02:04.1990213Z 2025-05-07T20:02:04.1991927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.1994893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.1996028Z ^ 2025-05-07T20:02:04.1996329Z 2025-05-07T20:02:04.1996911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:04.1997580Z 2025-05-07T20:02:04.1999259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.2002144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.2003398Z ^ 2025-05-07T20:02:04.2003791Z 2025-05-07T20:02:04.2005487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.2008224Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.2009463Z ^ 2025-05-07T20:02:04.2009716Z 2025-05-07T20:02:04.2010192Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:04.2010876Z 2025-05-07T20:02:04.2012620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.2015368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.2016549Z ^ 2025-05-07T20:02:04.2016941Z 2025-05-07T20:02:04.2018533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.2021169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.2022363Z ^ 2025-05-07T20:02:04.2022618Z 2025-05-07T20:02:04.2023052Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:04.2023709Z 2025-05-07T20:02:04.2025412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.2028087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.2029190Z ^ 2025-05-07T20:02:04.2029501Z 2025-05-07T20:02:04.2031074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.2033734Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.2034775Z ^ 2025-05-07T20:02:04.2035002Z 2025-05-07T20:02:04.2035393Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:04.2036006Z 2025-05-07T20:02:04.2037463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.2040266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.2041571Z ^ 2025-05-07T20:02:04.2041919Z 2025-05-07T20:02:09.6029635Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T20:02:09.6053799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.6056415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.6057630Z ^ 2025-05-07T20:02:09.6057907Z 2025-05-07T20:02:09.6058343Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:09.6058996Z 2025-05-07T20:02:09.6060767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.6063746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.6064895Z ^ 2025-05-07T20:02:09.6065254Z 2025-05-07T20:02:09.6066907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.6069361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.6070555Z ^ 2025-05-07T20:02:09.6070809Z 2025-05-07T20:02:09.6071266Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:09.6072053Z 2025-05-07T20:02:09.6073683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.6076422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.6077663Z ^ 2025-05-07T20:02:09.6078059Z 2025-05-07T20:02:09.6079778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.6082496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.6083665Z ^ 2025-05-07T20:02:09.6083938Z 2025-05-07T20:02:09.6084395Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:09.6085095Z 2025-05-07T20:02:09.6086762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.6089494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.6090703Z ^ 2025-05-07T20:02:09.6091075Z 2025-05-07T20:02:09.6092757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.6095604Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.6096789Z ^ 2025-05-07T20:02:09.6097047Z 2025-05-07T20:02:09.6097505Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:09.6098207Z 2025-05-07T20:02:09.6100063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.6102697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.6103691Z ^ 2025-05-07T20:02:09.6104031Z 2025-05-07T20:02:09.6105517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.6108220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.6109329Z ^ 2025-05-07T20:02:09.6109556Z 2025-05-07T20:02:09.6109980Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:09.6110592Z 2025-05-07T20:02:09.6112326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.6114914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.6116133Z ^ 2025-05-07T20:02:09.6116473Z 2025-05-07T20:02:13.2590221Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T20:02:13.2613532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:13.2616263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:13.2617421Z ^ 2025-05-07T20:02:13.2617688Z 2025-05-07T20:02:13.2618140Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:13.2619126Z 2025-05-07T20:02:13.2620978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:13.2623991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:13.2625091Z ^ 2025-05-07T20:02:13.2625412Z 2025-05-07T20:02:13.2627025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:13.2629911Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:13.2631071Z ^ 2025-05-07T20:02:13.2631330Z 2025-05-07T20:02:13.2631779Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:13.2632452Z 2025-05-07T20:02:13.2634066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:13.2636769Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:13.2637957Z ^ 2025-05-07T20:02:13.2638323Z 2025-05-07T20:02:13.2640044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:13.2642644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:13.2643643Z ^ 2025-05-07T20:02:13.2643837Z 2025-05-07T20:02:13.2644247Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:13.2644861Z 2025-05-07T20:02:13.2646539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:13.2649179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:13.2650329Z ^ 2025-05-07T20:02:13.2650695Z 2025-05-07T20:02:13.2652330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:13.2654968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:13.2656082Z ^ 2025-05-07T20:02:13.2656314Z 2025-05-07T20:02:13.2656713Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:13.2657365Z 2025-05-07T20:02:13.2659003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:13.2661916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:13.2663080Z ^ 2025-05-07T20:02:13.2663419Z 2025-05-07T20:02:13.2665404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:13.2668069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:13.2669217Z ^ 2025-05-07T20:02:13.2669458Z 2025-05-07T20:02:13.2669880Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:13.2670477Z 2025-05-07T20:02:13.2672049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:13.2674899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:13.2676079Z ^ 2025-05-07T20:02:13.2676431Z 2025-05-07T20:02:20.1161748Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:20.1186654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.1189698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:20.1190717Z ^ 2025-05-07T20:02:20.1190963Z 2025-05-07T20:02:20.1191360Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:20.1191972Z 2025-05-07T20:02:20.1193733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.1196319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:20.1197558Z ^ 2025-05-07T20:02:20.1197966Z 2025-05-07T20:02:20.1199591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.1202197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:20.1203392Z ^ 2025-05-07T20:02:20.1203638Z 2025-05-07T20:02:20.1204102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:20.1204712Z 2025-05-07T20:02:20.1206119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.1208811Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:20.1209951Z ^ 2025-05-07T20:02:20.1210299Z 2025-05-07T20:02:20.1211924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.1214667Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:20.1215717Z ^ 2025-05-07T20:02:20.1215961Z 2025-05-07T20:02:20.1216359Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:20.1216949Z 2025-05-07T20:02:20.1218539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.1221248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:20.1222518Z ^ 2025-05-07T20:02:20.1222823Z 2025-05-07T20:02:20.1224211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.1226428Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:20.1227413Z ^ 2025-05-07T20:02:20.1227634Z 2025-05-07T20:02:20.1228037Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:20.1228704Z 2025-05-07T20:02:20.1230414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.1233257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:20.1234336Z ^ 2025-05-07T20:02:20.1234694Z 2025-05-07T20:02:20.1236494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.1239131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:20.1240404Z ^ 2025-05-07T20:02:20.1240668Z 2025-05-07T20:02:20.1241210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:20.1241854Z 2025-05-07T20:02:20.1243505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.1246253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:20.1247445Z ^ 2025-05-07T20:02:20.1247809Z 2025-05-07T20:02:22.0878701Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:22.0897534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.0901080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.0902004Z ^ 2025-05-07T20:02:22.0902205Z 2025-05-07T20:02:22.0902580Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.0903197Z 2025-05-07T20:02:22.0904549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.0906571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.0907532Z ^ 2025-05-07T20:02:22.0907818Z 2025-05-07T20:02:22.0909103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.0911244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.0912119Z ^ 2025-05-07T20:02:22.0912339Z 2025-05-07T20:02:22.0912675Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.0913182Z 2025-05-07T20:02:22.0914417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.0916395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.0917393Z ^ 2025-05-07T20:02:22.0917701Z 2025-05-07T20:02:22.0918990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.0920964Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.0921905Z ^ 2025-05-07T20:02:22.0922413Z 2025-05-07T20:02:22.0922775Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.0923309Z 2025-05-07T20:02:22.0924592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.0926591Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.0927586Z ^ 2025-05-07T20:02:22.0927893Z 2025-05-07T20:02:22.0929204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.0931317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.0932490Z ^ 2025-05-07T20:02:22.0932686Z 2025-05-07T20:02:22.0933086Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.0933576Z 2025-05-07T20:02:22.0934955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.0937121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.0938022Z ^ 2025-05-07T20:02:22.0938448Z 2025-05-07T20:02:22.0939994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.0941980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.0942850Z ^ 2025-05-07T20:02:22.0943070Z 2025-05-07T20:02:22.0943405Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.0943898Z 2025-05-07T20:02:22.0945175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.0947177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.0948082Z ^ 2025-05-07T20:02:22.0948350Z 2025-05-07T20:02:23.9589511Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T20:02:23.9612135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.9615037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:23.9616242Z ^ 2025-05-07T20:02:23.9616513Z 2025-05-07T20:02:23.9616943Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:23.9617585Z 2025-05-07T20:02:23.9619299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.9622445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:23.9623651Z ^ 2025-05-07T20:02:23.9623999Z 2025-05-07T20:02:23.9625660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.9628309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:23.9629488Z ^ 2025-05-07T20:02:23.9629701Z 2025-05-07T20:02:23.9630089Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:23.9630661Z 2025-05-07T20:02:23.9632215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.9634869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:23.9636052Z ^ 2025-05-07T20:02:23.9636422Z 2025-05-07T20:02:23.9638078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.9640728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:23.9641864Z ^ 2025-05-07T20:02:23.9642128Z 2025-05-07T20:02:23.9642581Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:23.9643248Z 2025-05-07T20:02:23.9644937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.9647613Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:23.9648700Z ^ 2025-05-07T20:02:23.9649013Z 2025-05-07T20:02:23.9650808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.9653454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:23.9654769Z ^ 2025-05-07T20:02:23.9655015Z 2025-05-07T20:02:23.9655451Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:23.9656148Z 2025-05-07T20:02:23.9657931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.9660951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:23.9662126Z ^ 2025-05-07T20:02:23.9662514Z 2025-05-07T20:02:23.9664188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.9666852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:23.9668023Z ^ 2025-05-07T20:02:23.9668292Z 2025-05-07T20:02:23.9668753Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:23.9669408Z 2025-05-07T20:02:23.9671156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.9673928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:23.9675133Z ^ 2025-05-07T20:02:23.9675497Z 2025-05-07T20:02:25.9071840Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:02:25.9094368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.9097157Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:25.9098334Z ^ 2025-05-07T20:02:25.9098583Z 2025-05-07T20:02:25.9099053Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:25.9099916Z 2025-05-07T20:02:25.9101615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.9104287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:25.9105489Z ^ 2025-05-07T20:02:25.9105863Z 2025-05-07T20:02:25.9107475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.9110161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:25.9111350Z ^ 2025-05-07T20:02:25.9111609Z 2025-05-07T20:02:25.9112062Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:25.9112742Z 2025-05-07T20:02:25.9114460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.9117275Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:25.9118457Z ^ 2025-05-07T20:02:25.9118819Z 2025-05-07T20:02:25.9120519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.9123505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:25.9124692Z ^ 2025-05-07T20:02:25.9124949Z 2025-05-07T20:02:25.9125388Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:25.9126066Z 2025-05-07T20:02:25.9127769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.9130851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:25.9132043Z ^ 2025-05-07T20:02:25.9132540Z 2025-05-07T20:02:25.9134236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.9136893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:25.9138189Z ^ 2025-05-07T20:02:25.9138549Z 2025-05-07T20:02:25.9138974Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:25.9139685Z 2025-05-07T20:02:25.9141170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.9143848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:25.9145065Z ^ 2025-05-07T20:02:25.9157437Z 2025-05-07T20:02:25.9159198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.9161971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:25.9163172Z ^ 2025-05-07T20:02:25.9163402Z 2025-05-07T20:02:25.9163846Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:25.9164517Z 2025-05-07T20:02:25.9166241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.9168937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:25.9170105Z ^ 2025-05-07T20:02:25.9170424Z 2025-05-07T20:02:30.3834776Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:30.3861615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3864288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3865450Z ^ 2025-05-07T20:02:30.3865724Z 2025-05-07T20:02:30.3866134Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.3866764Z 2025-05-07T20:02:30.3868355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3870910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3872038Z ^ 2025-05-07T20:02:30.3872353Z 2025-05-07T20:02:30.3874040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3876632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3877760Z ^ 2025-05-07T20:02:30.3878001Z 2025-05-07T20:02:30.3878433Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.3879116Z 2025-05-07T20:02:30.3880792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3883472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3884706Z ^ 2025-05-07T20:02:30.3885066Z 2025-05-07T20:02:30.3886740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3889338Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3890658Z ^ 2025-05-07T20:02:30.3890885Z 2025-05-07T20:02:30.3891266Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.3891800Z 2025-05-07T20:02:30.3893246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3895571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3896706Z ^ 2025-05-07T20:02:30.3897049Z 2025-05-07T20:02:30.3898706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3901599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3902810Z ^ 2025-05-07T20:02:30.3903078Z 2025-05-07T20:02:30.3903544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.3904236Z 2025-05-07T20:02:30.3905978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3908768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3910013Z ^ 2025-05-07T20:02:30.3910390Z 2025-05-07T20:02:30.3912139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3914937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3916147Z ^ 2025-05-07T20:02:30.3916404Z 2025-05-07T20:02:30.3916842Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.3917445Z 2025-05-07T20:02:30.3919070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3921644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3923049Z ^ 2025-05-07T20:02:30.3923449Z 2025-05-07T20:02:40.9684894Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:40.9709122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.9711856Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.9713075Z ^ 2025-05-07T20:02:40.9713338Z 2025-05-07T20:02:40.9713816Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.9714485Z 2025-05-07T20:02:40.9716090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.9718675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.9719882Z ^ 2025-05-07T20:02:40.9720246Z 2025-05-07T20:02:40.9722193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.9724902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.9726050Z ^ 2025-05-07T20:02:40.9726290Z 2025-05-07T20:02:40.9726734Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.9727378Z 2025-05-07T20:02:40.9728946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.9731642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.9732753Z ^ 2025-05-07T20:02:40.9733404Z 2025-05-07T20:02:40.9734985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.9737547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.9738663Z ^ 2025-05-07T20:02:40.9738878Z 2025-05-07T20:02:40.9739281Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.9740030Z 2025-05-07T20:02:40.9741776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.9744582Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.9745702Z ^ 2025-05-07T20:02:40.9746062Z 2025-05-07T20:02:40.9747623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.9750315Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.9751444Z ^ 2025-05-07T20:02:40.9751701Z 2025-05-07T20:02:40.9752135Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.9752808Z 2025-05-07T20:02:40.9754484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.9757143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.9758333Z ^ 2025-05-07T20:02:40.9758693Z 2025-05-07T20:02:40.9760347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.9762943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.9764127Z ^ 2025-05-07T20:02:40.9764385Z 2025-05-07T20:02:40.9764811Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.9765420Z 2025-05-07T20:02:40.9767031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.9769687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.9770781Z ^ 2025-05-07T20:02:40.9771103Z 2025-05-07T20:02:42.6112521Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:42.6136756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6139666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6140880Z ^ 2025-05-07T20:02:42.6141151Z 2025-05-07T20:02:42.6141583Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:42.6142231Z 2025-05-07T20:02:42.6143918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6146687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6147855Z ^ 2025-05-07T20:02:42.6148224Z 2025-05-07T20:02:42.6149856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6152601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6153780Z ^ 2025-05-07T20:02:42.6154028Z 2025-05-07T20:02:42.6154465Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:42.6155162Z 2025-05-07T20:02:42.6156948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6159880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6161061Z ^ 2025-05-07T20:02:42.6161436Z 2025-05-07T20:02:42.6163241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6165801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6167084Z ^ 2025-05-07T20:02:42.6167349Z 2025-05-07T20:02:42.6167912Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:42.6168578Z 2025-05-07T20:02:42.6170155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6172745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6173887Z ^ 2025-05-07T20:02:42.6174244Z 2025-05-07T20:02:42.6175882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6178726Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6180025Z ^ 2025-05-07T20:02:42.6180274Z 2025-05-07T20:02:42.6180662Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:42.6181295Z 2025-05-07T20:02:42.6182897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6185541Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6186690Z ^ 2025-05-07T20:02:42.6187068Z 2025-05-07T20:02:42.6188715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6191171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6192321Z ^ 2025-05-07T20:02:42.6192577Z 2025-05-07T20:02:42.6193047Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:42.6193745Z 2025-05-07T20:02:42.6195410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6198053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6199234Z ^ 2025-05-07T20:02:42.6199598Z 2025-05-07T20:02:49.9711313Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T20:02:49.9735887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.9738247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.9739163Z ^ 2025-05-07T20:02:49.9739367Z 2025-05-07T20:02:49.9739842Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.9740410Z 2025-05-07T20:02:49.9741883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.9744454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.9745634Z ^ 2025-05-07T20:02:49.9745978Z 2025-05-07T20:02:49.9747524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.9750155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.9751572Z ^ 2025-05-07T20:02:49.9751809Z 2025-05-07T20:02:49.9752229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.9752849Z 2025-05-07T20:02:49.9754612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.9757193Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.9758320Z ^ 2025-05-07T20:02:49.9758766Z 2025-05-07T20:02:49.9763526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.9766361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.9767485Z ^ 2025-05-07T20:02:49.9767736Z 2025-05-07T20:02:49.9768189Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.9768854Z 2025-05-07T20:02:49.9770511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.9773180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.9774342Z ^ 2025-05-07T20:02:49.9774708Z 2025-05-07T20:02:49.9776364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.9778945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.9780249Z ^ 2025-05-07T20:02:49.9780515Z 2025-05-07T20:02:49.9780950Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.9781633Z 2025-05-07T20:02:49.9783252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.9785876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.9787075Z ^ 2025-05-07T20:02:49.9787423Z 2025-05-07T20:02:49.9789077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.9791762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.9792945Z ^ 2025-05-07T20:02:49.9793199Z 2025-05-07T20:02:49.9793655Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.9794352Z 2025-05-07T20:02:49.9796093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.9798868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.9800229Z ^ 2025-05-07T20:02:49.9800604Z 2025-05-07T20:02:52.9145263Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:52.9166134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9168570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9169600Z ^ 2025-05-07T20:02:52.9169835Z 2025-05-07T20:02:52.9170236Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.9170811Z 2025-05-07T20:02:52.9172255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9174542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9175605Z ^ 2025-05-07T20:02:52.9176294Z 2025-05-07T20:02:52.9177733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9180305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9181324Z ^ 2025-05-07T20:02:52.9181541Z 2025-05-07T20:02:52.9181935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.9182505Z 2025-05-07T20:02:52.9184021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9186461Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9187509Z ^ 2025-05-07T20:02:52.9187816Z 2025-05-07T20:02:52.9189199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9191424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9192419Z ^ 2025-05-07T20:02:52.9192646Z 2025-05-07T20:02:52.9193019Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.9193572Z 2025-05-07T20:02:52.9195034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9197304Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9198335Z ^ 2025-05-07T20:02:52.9198647Z 2025-05-07T20:02:52.9200047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9202333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9203358Z ^ 2025-05-07T20:02:52.9203579Z 2025-05-07T20:02:52.9203965Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.9204563Z 2025-05-07T20:02:52.9205981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9208276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9209276Z ^ 2025-05-07T20:02:52.9209590Z 2025-05-07T20:02:52.9210987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9213306Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9214297Z ^ 2025-05-07T20:02:52.9214537Z 2025-05-07T20:02:52.9214927Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.9215650Z 2025-05-07T20:02:52.9217090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9219626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9220652Z ^ 2025-05-07T20:02:52.9220958Z 2025-05-07T20:02:58.2509078Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:02:58.2533718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.2536303Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.2537504Z ^ 2025-05-07T20:02:58.2537764Z 2025-05-07T20:02:58.2538156Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.2538850Z 2025-05-07T20:02:58.2540601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.2543503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.2544726Z ^ 2025-05-07T20:02:58.2545246Z 2025-05-07T20:02:58.2546957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.2549838Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.2550957Z ^ 2025-05-07T20:02:58.2551321Z 2025-05-07T20:02:58.2551761Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.2552437Z 2025-05-07T20:02:58.2554159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.2556885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.2558118Z ^ 2025-05-07T20:02:58.2558477Z 2025-05-07T20:02:58.2560176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.2562852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.2563898Z ^ 2025-05-07T20:02:58.2564142Z 2025-05-07T20:02:58.2564578Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.2565273Z 2025-05-07T20:02:58.2566937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.2569565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.2570717Z ^ 2025-05-07T20:02:58.2571083Z 2025-05-07T20:02:58.2572701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.2575326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.2576464Z ^ 2025-05-07T20:02:58.2576717Z 2025-05-07T20:02:58.2577142Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.2577782Z 2025-05-07T20:02:58.2579221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.2581920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.2583008Z ^ 2025-05-07T20:02:58.2583287Z 2025-05-07T20:02:58.2584859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.2587649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.2588856Z ^ 2025-05-07T20:02:58.2589106Z 2025-05-07T20:02:58.2589705Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.2590396Z 2025-05-07T20:02:58.2592101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.2594973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.2596127Z ^ 2025-05-07T20:02:58.2596495Z 2025-05-07T20:02:58.4803923Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:02:58.4827588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4830528Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4831982Z ^ 2025-05-07T20:02:58.4832268Z 2025-05-07T20:02:58.4832712Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.4833367Z 2025-05-07T20:02:58.4835297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4838046Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4839370Z ^ 2025-05-07T20:02:58.4839766Z 2025-05-07T20:02:58.4841670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4844334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4845467Z ^ 2025-05-07T20:02:58.4845719Z 2025-05-07T20:02:58.4846161Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.4846820Z 2025-05-07T20:02:58.4848540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4851242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4852427Z ^ 2025-05-07T20:02:58.4852820Z 2025-05-07T20:02:58.4854331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4856992Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4858175Z ^ 2025-05-07T20:02:58.4858466Z 2025-05-07T20:02:58.4858925Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.4859759Z 2025-05-07T20:02:58.4861584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4864423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4865658Z ^ 2025-05-07T20:02:58.4866032Z 2025-05-07T20:02:58.4867829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4870644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4871873Z ^ 2025-05-07T20:02:58.4872126Z 2025-05-07T20:02:58.4872570Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.4873235Z 2025-05-07T20:02:58.4874935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4877673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4879005Z ^ 2025-05-07T20:02:58.4879396Z 2025-05-07T20:02:58.4881154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4883937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4885193Z ^ 2025-05-07T20:02:58.4885561Z 2025-05-07T20:02:58.4886020Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.4886726Z 2025-05-07T20:02:58.4888573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4891413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4892679Z ^ 2025-05-07T20:02:58.4893057Z 2025-05-07T20:03:00.9127827Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:00.9152335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.9155122Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.9156675Z ^ 2025-05-07T20:03:00.9156943Z 2025-05-07T20:03:00.9157418Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.9158072Z 2025-05-07T20:03:00.9159735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.9162761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.9163979Z ^ 2025-05-07T20:03:00.9164347Z 2025-05-07T20:03:00.9166014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.9168722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.9169870Z ^ 2025-05-07T20:03:00.9170133Z 2025-05-07T20:03:00.9170576Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.9171257Z 2025-05-07T20:03:00.9172960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.9175665Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.9176862Z ^ 2025-05-07T20:03:00.9177223Z 2025-05-07T20:03:00.9178603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.9181115Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.9182244Z ^ 2025-05-07T20:03:00.9182483Z 2025-05-07T20:03:00.9182919Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.9183595Z 2025-05-07T20:03:00.9185196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.9187997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.9189203Z ^ 2025-05-07T20:03:00.9189572Z 2025-05-07T20:03:00.9191272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.9193915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.9195181Z ^ 2025-05-07T20:03:00.9195467Z 2025-05-07T20:03:00.9195935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.9196828Z 2025-05-07T20:03:00.9198581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.9201531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.9202643Z ^ 2025-05-07T20:03:00.9202981Z 2025-05-07T20:03:00.9204575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.9207513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.9208634Z ^ 2025-05-07T20:03:00.9208864Z 2025-05-07T20:03:00.9209294Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.9210007Z 2025-05-07T20:03:00.9211780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.9214415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.9215663Z ^ 2025-05-07T20:03:00.9216040Z 2025-05-07T20:03:02.1488349Z [381/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:02.1511525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.1514007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.1515226Z ^ 2025-05-07T20:03:02.1515472Z 2025-05-07T20:03:02.1515972Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.1516566Z 2025-05-07T20:03:02.1518058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.1520456Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.1521525Z ^ 2025-05-07T20:03:02.1521857Z 2025-05-07T20:03:02.1523832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.1526491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.1527634Z ^ 2025-05-07T20:03:02.1527964Z 2025-05-07T20:03:02.1528407Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.1529037Z 2025-05-07T20:03:02.1530654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.1533085Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.1534147Z ^ 2025-05-07T20:03:02.1534470Z 2025-05-07T20:03:02.1535946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.1538378Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.1539437Z ^ 2025-05-07T20:03:02.1539786Z 2025-05-07T20:03:02.1540174Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.1540750Z 2025-05-07T20:03:02.1542198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.1544776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.1546006Z ^ 2025-05-07T20:03:02.1546352Z 2025-05-07T20:03:02.1547895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.1550731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.1551783Z ^ 2025-05-07T20:03:02.1551999Z 2025-05-07T20:03:02.1552577Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.1553247Z 2025-05-07T20:03:02.1554810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.1557606Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.1558915Z ^ 2025-05-07T20:03:02.1559289Z 2025-05-07T20:03:02.1560881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.1563319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.1564508Z ^ 2025-05-07T20:03:02.1564751Z 2025-05-07T20:03:02.1565193Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.1565850Z 2025-05-07T20:03:02.1567482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.1570089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.1571263Z ^ 2025-05-07T20:03:02.1571639Z 2025-05-07T20:03:02.4349828Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T20:03:02.4373821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4376472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4377564Z ^ 2025-05-07T20:03:02.4377819Z 2025-05-07T20:03:02.4378263Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.4378915Z 2025-05-07T20:03:02.4380687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4383292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4384452Z ^ 2025-05-07T20:03:02.4384805Z 2025-05-07T20:03:02.4386338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4388934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4390058Z ^ 2025-05-07T20:03:02.4390344Z 2025-05-07T20:03:02.4390743Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.4391376Z 2025-05-07T20:03:02.4392972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4395583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4396713Z ^ 2025-05-07T20:03:02.4397072Z 2025-05-07T20:03:02.4398635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4401212Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4402323Z ^ 2025-05-07T20:03:02.4402565Z 2025-05-07T20:03:02.4403027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.4403635Z 2025-05-07T20:03:02.4405199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4407801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4409211Z ^ 2025-05-07T20:03:02.4409586Z 2025-05-07T20:03:02.4412213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4414563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4415679Z ^ 2025-05-07T20:03:02.4416061Z 2025-05-07T20:03:02.4416504Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.4417129Z 2025-05-07T20:03:02.4418674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4421367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4422700Z ^ 2025-05-07T20:03:02.4423025Z 2025-05-07T20:03:02.4424468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4426974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4428036Z ^ 2025-05-07T20:03:02.4428262Z 2025-05-07T20:03:02.4428637Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.4429264Z 2025-05-07T20:03:02.4430867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4433427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4434514Z ^ 2025-05-07T20:03:02.4434865Z 2025-05-07T20:03:02.5572126Z [383/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:03:02.5594449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.5597080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.5598257Z ^ 2025-05-07T20:03:02.5598593Z 2025-05-07T20:03:02.5599009Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.5599632Z 2025-05-07T20:03:02.5601261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.5603726Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.5604825Z ^ 2025-05-07T20:03:02.5605151Z 2025-05-07T20:03:02.5606693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.5609317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.5610404Z ^ 2025-05-07T20:03:02.5610641Z 2025-05-07T20:03:02.5611078Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.5611739Z 2025-05-07T20:03:02.5613269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.5615792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.5616918Z ^ 2025-05-07T20:03:02.5617300Z 2025-05-07T20:03:02.5618947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.5621687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.5623039Z ^ 2025-05-07T20:03:02.5623300Z 2025-05-07T20:03:02.5623733Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.5624750Z 2025-05-07T20:03:02.5626409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.5629153Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.5630219Z ^ 2025-05-07T20:03:02.5630543Z 2025-05-07T20:03:02.5632139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.5634926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.5635991Z ^ 2025-05-07T20:03:02.5636236Z 2025-05-07T20:03:02.5636562Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.5637114Z 2025-05-07T20:03:02.5638507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.5641065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.5642071Z ^ 2025-05-07T20:03:02.5642406Z 2025-05-07T20:03:02.5644025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.5646438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.5647565Z ^ 2025-05-07T20:03:02.5647828Z 2025-05-07T20:03:02.5648274Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.5648909Z 2025-05-07T20:03:02.5650520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.5652950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.5654052Z ^ 2025-05-07T20:03:02.5654341Z 2025-05-07T20:03:07.2377150Z [384/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:07.2399215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.2401797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.2402951Z ^ 2025-05-07T20:03:07.2403162Z 2025-05-07T20:03:07.2403536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:07.2404192Z 2025-05-07T20:03:07.2405781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.2408149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.2409185Z ^ 2025-05-07T20:03:07.2409524Z 2025-05-07T20:03:07.2410985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.2413506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.2414649Z ^ 2025-05-07T20:03:07.2414884Z 2025-05-07T20:03:07.2415305Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:07.2415884Z 2025-05-07T20:03:07.2417391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.2420105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.2421260Z ^ 2025-05-07T20:03:07.2421603Z 2025-05-07T20:03:07.2423415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.2426324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.2427419Z ^ 2025-05-07T20:03:07.2427629Z 2025-05-07T20:03:07.2428020Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:07.2428638Z 2025-05-07T20:03:07.2430397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.2432914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.2434228Z ^ 2025-05-07T20:03:07.2434690Z 2025-05-07T20:03:07.2436395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.2438933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.2440051Z ^ 2025-05-07T20:03:07.2440301Z 2025-05-07T20:03:07.2440787Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:07.2441372Z 2025-05-07T20:03:07.2442894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.2444958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.2445749Z ^ 2025-05-07T20:03:07.2446024Z 2025-05-07T20:03:07.2447299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.2449453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.2450341Z ^ 2025-05-07T20:03:07.2450527Z 2025-05-07T20:03:07.2450826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:07.2451385Z 2025-05-07T20:03:07.2452794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.2455011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.2456136Z ^ 2025-05-07T20:03:07.2456390Z 2025-05-07T20:03:08.6049219Z [385/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T20:03:08.6071481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.6074134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.6075248Z ^ 2025-05-07T20:03:08.6075524Z 2025-05-07T20:03:08.6075946Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:08.6076554Z 2025-05-07T20:03:08.6078103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.6080547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.6081658Z ^ 2025-05-07T20:03:08.6081993Z 2025-05-07T20:03:08.6083490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.6086009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.6087045Z ^ 2025-05-07T20:03:08.6087295Z 2025-05-07T20:03:08.6087729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:08.6088369Z 2025-05-07T20:03:08.6089936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.6092608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.6094040Z ^ 2025-05-07T20:03:08.6094425Z 2025-05-07T20:03:08.6096016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.6098703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.6099921Z ^ 2025-05-07T20:03:08.6100157Z 2025-05-07T20:03:08.6100609Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:08.6101331Z 2025-05-07T20:03:08.6102959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.6105508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.6106693Z ^ 2025-05-07T20:03:08.6107051Z 2025-05-07T20:03:08.6108645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.6111386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.6112594Z ^ 2025-05-07T20:03:08.6112833Z 2025-05-07T20:03:08.6113276Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:08.6113924Z 2025-05-07T20:03:08.6115513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.6118073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.6119250Z ^ 2025-05-07T20:03:08.6119605Z 2025-05-07T20:03:08.6121262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.6124219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.6125420Z ^ 2025-05-07T20:03:08.6125690Z 2025-05-07T20:03:08.6126147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:08.6126778Z 2025-05-07T20:03:08.6128348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.6130933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.6132115Z ^ 2025-05-07T20:03:08.6132440Z 2025-05-07T20:03:10.2596418Z [386/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T20:03:10.2611203Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:11.3171784Z [387/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T20:03:11.3193183Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:11.5843649Z [388/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:03:11.5863906Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:23.3578625Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:23.3604036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3606971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3608141Z ^ 2025-05-07T20:03:23.3608439Z 2025-05-07T20:03:23.3608923Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.3609631Z 2025-05-07T20:03:23.3611399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3614202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3615424Z ^ 2025-05-07T20:03:23.3615804Z 2025-05-07T20:03:23.3617555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3620694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3622179Z ^ 2025-05-07T20:03:23.3622462Z 2025-05-07T20:03:23.3622910Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.3623656Z 2025-05-07T20:03:23.3625445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3628350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3629617Z ^ 2025-05-07T20:03:23.3630031Z 2025-05-07T20:03:23.3631716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3634485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3635901Z ^ 2025-05-07T20:03:23.3636186Z 2025-05-07T20:03:23.3636656Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.3637355Z 2025-05-07T20:03:23.3639250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3642029Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3643285Z ^ 2025-05-07T20:03:23.3643743Z 2025-05-07T20:03:23.3645501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3648308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3649566Z ^ 2025-05-07T20:03:23.3649832Z 2025-05-07T20:03:23.3650306Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.3651015Z 2025-05-07T20:03:23.3652767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3655557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3656774Z ^ 2025-05-07T20:03:23.3657156Z 2025-05-07T20:03:23.3658835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3661830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3663065Z ^ 2025-05-07T20:03:23.3663327Z 2025-05-07T20:03:23.3663827Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.3664540Z 2025-05-07T20:03:23.3666300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3668983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3670158Z ^ 2025-05-07T20:03:23.3670528Z 2025-05-07T20:03:23.6573983Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:23.6596881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.6599440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.6600548Z ^ 2025-05-07T20:03:23.6600796Z 2025-05-07T20:03:23.6601224Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.6601849Z 2025-05-07T20:03:23.6603351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.6605932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.6607070Z ^ 2025-05-07T20:03:23.6607440Z 2025-05-07T20:03:23.6608989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.6611477Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.6612551Z ^ 2025-05-07T20:03:23.6612814Z 2025-05-07T20:03:23.6613317Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.6614019Z 2025-05-07T20:03:23.6615767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.6618621Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.6620076Z ^ 2025-05-07T20:03:23.6620693Z 2025-05-07T20:03:23.6622655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.6625665Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.6626933Z ^ 2025-05-07T20:03:23.6627202Z 2025-05-07T20:03:23.6627671Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.6628363Z 2025-05-07T20:03:23.6630257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.6633182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.6634475Z ^ 2025-05-07T20:03:23.6634839Z 2025-05-07T20:03:23.6636381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.6639004Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.6640199Z ^ 2025-05-07T20:03:23.6640452Z 2025-05-07T20:03:23.6640925Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.6641603Z 2025-05-07T20:03:23.6643291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.6646108Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.6647370Z ^ 2025-05-07T20:03:23.6647751Z 2025-05-07T20:03:23.6649469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.6652234Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.6653460Z ^ 2025-05-07T20:03:23.6653760Z 2025-05-07T20:03:23.6654228Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.6654920Z 2025-05-07T20:03:23.6656673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.6659217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.6660511Z ^ 2025-05-07T20:03:23.6660861Z 2025-05-07T20:03:24.0418937Z [391/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:03:24.0439715Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:25.4193188Z [392/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:25.4217691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.4220292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.4221640Z ^ 2025-05-07T20:03:25.4221901Z 2025-05-07T20:03:25.4222752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:25.4223355Z 2025-05-07T20:03:25.4225010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.4227725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.4228997Z ^ 2025-05-07T20:03:25.4229382Z 2025-05-07T20:03:25.4231201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.4234027Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.4235226Z ^ 2025-05-07T20:03:25.4235464Z 2025-05-07T20:03:25.4235834Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:25.4236461Z 2025-05-07T20:03:25.4237927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.4240352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.4241418Z ^ 2025-05-07T20:03:25.4241748Z 2025-05-07T20:03:25.4243360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.4245825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.4247041Z ^ 2025-05-07T20:03:25.4247309Z 2025-05-07T20:03:25.4247805Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:25.4248473Z 2025-05-07T20:03:25.4250198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.4253175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.4254411Z ^ 2025-05-07T20:03:25.4254787Z 2025-05-07T20:03:25.4256503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.4259667Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.4260900Z ^ 2025-05-07T20:03:25.4261215Z 2025-05-07T20:03:25.4261670Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:25.4262334Z 2025-05-07T20:03:25.4264173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.4266934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.4268241Z ^ 2025-05-07T20:03:25.4268737Z 2025-05-07T20:03:25.4270484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.4273409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.4274593Z ^ 2025-05-07T20:03:25.4274858Z 2025-05-07T20:03:25.4275341Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:25.4276023Z 2025-05-07T20:03:25.4277715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.4280430Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.4281634Z ^ 2025-05-07T20:03:25.4282047Z 2025-05-07T20:03:28.4874003Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:28.4900055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.4902919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.4904180Z ^ 2025-05-07T20:03:28.4904448Z 2025-05-07T20:03:28.4904939Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.4905619Z 2025-05-07T20:03:28.4907357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.4910406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.4911736Z ^ 2025-05-07T20:03:28.4912142Z 2025-05-07T20:03:28.4913909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.4916504Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.4917782Z ^ 2025-05-07T20:03:28.4918055Z 2025-05-07T20:03:28.4918553Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.4919249Z 2025-05-07T20:03:28.4920993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.4924015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.4925249Z ^ 2025-05-07T20:03:28.4925658Z 2025-05-07T20:03:28.4927350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.4930076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.4931517Z ^ 2025-05-07T20:03:28.4931777Z 2025-05-07T20:03:28.4932252Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.4933005Z 2025-05-07T20:03:28.4934789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.4937695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.4939198Z ^ 2025-05-07T20:03:28.4939711Z 2025-05-07T20:03:28.4941454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.4944477Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.4945718Z ^ 2025-05-07T20:03:28.4946018Z 2025-05-07T20:03:28.4946490Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.4947307Z 2025-05-07T20:03:28.4949233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.4952110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.4953380Z ^ 2025-05-07T20:03:28.4953778Z 2025-05-07T20:03:28.4955484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.4958357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.4959628Z ^ 2025-05-07T20:03:28.4959908Z 2025-05-07T20:03:28.4960389Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.4961137Z 2025-05-07T20:03:28.4962688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.4965480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.4966708Z ^ 2025-05-07T20:03:28.4967122Z 2025-05-07T20:03:28.6536908Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:28.6562372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.6565231Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.6566516Z ^ 2025-05-07T20:03:28.6566759Z 2025-05-07T20:03:28.6567249Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.6567956Z 2025-05-07T20:03:28.6569729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.6572503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.6573782Z ^ 2025-05-07T20:03:28.6574192Z 2025-05-07T20:03:28.6575986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.6578869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.6580317Z ^ 2025-05-07T20:03:28.6580602Z 2025-05-07T20:03:28.6581130Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.6581832Z 2025-05-07T20:03:28.6583628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.6586267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.6587465Z ^ 2025-05-07T20:03:28.6587841Z 2025-05-07T20:03:28.6589607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.6592404Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.6593699Z ^ 2025-05-07T20:03:28.6593970Z 2025-05-07T20:03:28.6594468Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.6595134Z 2025-05-07T20:03:28.6596911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.6599978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.6601370Z ^ 2025-05-07T20:03:28.6601708Z 2025-05-07T20:03:28.6603460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.6609311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.6610746Z ^ 2025-05-07T20:03:28.6611035Z 2025-05-07T20:03:28.6611544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.6612304Z 2025-05-07T20:03:28.6614093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.6617006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.6618298Z ^ 2025-05-07T20:03:28.6618727Z 2025-05-07T20:03:28.6620712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.6623754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.6625053Z ^ 2025-05-07T20:03:28.6625351Z 2025-05-07T20:03:28.6625847Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.6626573Z 2025-05-07T20:03:28.6628390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.6631241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.6632562Z ^ 2025-05-07T20:03:28.6632971Z 2025-05-07T20:03:29.4839355Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:29.4860902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.4863263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.4864319Z ^ 2025-05-07T20:03:29.4864579Z 2025-05-07T20:03:29.4865003Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:29.4865598Z 2025-05-07T20:03:29.4866925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.4869361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.4870434Z ^ 2025-05-07T20:03:29.4870745Z 2025-05-07T20:03:29.4872217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.4874549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.4875646Z ^ 2025-05-07T20:03:29.4875894Z 2025-05-07T20:03:29.4876357Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:29.4876916Z 2025-05-07T20:03:29.4878487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.4880956Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.4882094Z ^ 2025-05-07T20:03:29.4882459Z 2025-05-07T20:03:29.4883963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.4886280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.4887672Z ^ 2025-05-07T20:03:29.4887918Z 2025-05-07T20:03:29.4888302Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:29.4888958Z 2025-05-07T20:03:29.4890642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.4893205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.4894432Z ^ 2025-05-07T20:03:29.4894798Z 2025-05-07T20:03:29.4896387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.4898920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.4900239Z ^ 2025-05-07T20:03:29.4900510Z 2025-05-07T20:03:29.4900924Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:29.4901546Z 2025-05-07T20:03:29.4902977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.4905481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.4906619Z ^ 2025-05-07T20:03:29.4906968Z 2025-05-07T20:03:29.4908481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.4910754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.4911712Z ^ 2025-05-07T20:03:29.4911935Z 2025-05-07T20:03:29.4912325Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:29.4912940Z 2025-05-07T20:03:29.4914434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.4916869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.4917911Z ^ 2025-05-07T20:03:29.4918250Z 2025-05-07T20:03:33.0637136Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:33.0658686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.0661361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.0662494Z ^ 2025-05-07T20:03:33.0662761Z 2025-05-07T20:03:33.0663205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.0663820Z 2025-05-07T20:03:33.0665483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.0667969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.0669144Z ^ 2025-05-07T20:03:33.0669430Z 2025-05-07T20:03:33.0670790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.0673193Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.0674215Z ^ 2025-05-07T20:03:33.0674445Z 2025-05-07T20:03:33.0674853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.0675502Z 2025-05-07T20:03:33.0677073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.0679576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.0680943Z ^ 2025-05-07T20:03:33.0681300Z 2025-05-07T20:03:33.0682858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.0685510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.0686604Z ^ 2025-05-07T20:03:33.0687047Z 2025-05-07T20:03:33.0687459Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.0688135Z 2025-05-07T20:03:33.0689744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.0692357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.0693537Z ^ 2025-05-07T20:03:33.0693905Z 2025-05-07T20:03:33.0695541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.0698087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.0699264Z ^ 2025-05-07T20:03:33.0699654Z 2025-05-07T20:03:33.0700086Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.0700735Z 2025-05-07T20:03:33.0702269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.0704977Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.0706170Z ^ 2025-05-07T20:03:33.0706544Z 2025-05-07T20:03:33.0708149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.0710718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.0711775Z ^ 2025-05-07T20:03:33.0711998Z 2025-05-07T20:03:33.0712362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.0712883Z 2025-05-07T20:03:33.0714225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.0716304Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.0717358Z ^ 2025-05-07T20:03:33.0717669Z 2025-05-07T20:03:34.3292274Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:34.3309014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.3310889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.3311712Z ^ 2025-05-07T20:03:34.3311934Z 2025-05-07T20:03:34.3312272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.3312725Z 2025-05-07T20:03:34.3313921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.3315770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.3316627Z ^ 2025-05-07T20:03:34.3316887Z 2025-05-07T20:03:34.3318050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.3319879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.3320733Z ^ 2025-05-07T20:03:34.3320925Z 2025-05-07T20:03:34.3321253Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.3321762Z 2025-05-07T20:03:34.3323417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.3325266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.3326230Z ^ 2025-05-07T20:03:34.3326525Z 2025-05-07T20:03:34.3327631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.3329628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.3330443Z ^ 2025-05-07T20:03:34.3330659Z 2025-05-07T20:03:34.3330977Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.3331436Z 2025-05-07T20:03:34.3332571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.3334462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.3335345Z ^ 2025-05-07T20:03:34.3335616Z 2025-05-07T20:03:34.3336725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.3338516Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.3339359Z ^ 2025-05-07T20:03:34.3339671Z 2025-05-07T20:03:34.3339992Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.3340491Z 2025-05-07T20:03:34.3341625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.3343463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.3344284Z ^ 2025-05-07T20:03:34.3344556Z 2025-05-07T20:03:34.3345669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.3347460Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.3348317Z ^ 2025-05-07T20:03:34.3348506Z 2025-05-07T20:03:34.3348853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.3349310Z 2025-05-07T20:03:34.3350418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.3352293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.3353150Z ^ 2025-05-07T20:03:34.3353403Z 2025-05-07T20:03:36.2905069Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:36.2927961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:36.2930565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:36.2931721Z ^ 2025-05-07T20:03:36.2931980Z 2025-05-07T20:03:36.2932513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:36.2933144Z 2025-05-07T20:03:36.2934723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:36.2937268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:36.2938445Z ^ 2025-05-07T20:03:36.2938819Z 2025-05-07T20:03:36.2940534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:36.2943464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:36.2944576Z ^ 2025-05-07T20:03:36.2944824Z 2025-05-07T20:03:36.2945234Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:36.2945848Z 2025-05-07T20:03:36.2947715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:36.2950284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:36.2951547Z ^ 2025-05-07T20:03:36.2951862Z 2025-05-07T20:03:36.2953570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:36.2956040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:36.2957178Z ^ 2025-05-07T20:03:36.2957424Z 2025-05-07T20:03:36.2957890Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:36.2958492Z 2025-05-07T20:03:36.2960028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:36.2962623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:36.2963780Z ^ 2025-05-07T20:03:36.2964165Z 2025-05-07T20:03:36.2965726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:36.2968216Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:36.2969323Z ^ 2025-05-07T20:03:36.2969610Z 2025-05-07T20:03:36.2970033Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:36.2970652Z 2025-05-07T20:03:36.2972219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:36.2974744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:36.2975930Z ^ 2025-05-07T20:03:36.2976282Z 2025-05-07T20:03:36.2977865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:36.2980486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:36.2981619Z ^ 2025-05-07T20:03:36.2981877Z 2025-05-07T20:03:36.2982300Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:36.2982949Z 2025-05-07T20:03:36.2984523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:36.2987317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:36.2988447Z ^ 2025-05-07T20:03:36.2988819Z 2025-05-07T20:03:37.9720108Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:37.9742106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9744776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9745925Z ^ 2025-05-07T20:03:37.9746295Z 2025-05-07T20:03:37.9746739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:37.9747383Z 2025-05-07T20:03:37.9748845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9751371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9752494Z ^ 2025-05-07T20:03:37.9753172Z 2025-05-07T20:03:37.9754781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9757479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9758609Z ^ 2025-05-07T20:03:37.9758849Z 2025-05-07T20:03:37.9759281Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:37.9759889Z 2025-05-07T20:03:37.9764705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9767423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9768623Z ^ 2025-05-07T20:03:37.9768985Z 2025-05-07T20:03:37.9770570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9773066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9774124Z ^ 2025-05-07T20:03:37.9774337Z 2025-05-07T20:03:37.9774760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:37.9775407Z 2025-05-07T20:03:37.9776972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9779655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9780794Z ^ 2025-05-07T20:03:37.9781150Z 2025-05-07T20:03:37.9782594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9785262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9786438Z ^ 2025-05-07T20:03:37.9786699Z 2025-05-07T20:03:37.9787175Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:37.9787810Z 2025-05-07T20:03:37.9789420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9792049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9793210Z ^ 2025-05-07T20:03:37.9793579Z 2025-05-07T20:03:37.9795125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9797762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9798895Z ^ 2025-05-07T20:03:37.9799184Z 2025-05-07T20:03:37.9799828Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:37.9800457Z 2025-05-07T20:03:37.9802054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9804947Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9806022Z ^ 2025-05-07T20:03:37.9806342Z 2025-05-07T20:03:46.3230728Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:46.3255680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3258491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3259979Z ^ 2025-05-07T20:03:46.3260280Z 2025-05-07T20:03:46.3260760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:46.3261497Z 2025-05-07T20:03:46.3263303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3266446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3267686Z ^ 2025-05-07T20:03:46.3268097Z 2025-05-07T20:03:46.3269971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3272544Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3273801Z ^ 2025-05-07T20:03:46.3274056Z 2025-05-07T20:03:46.3274617Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:46.3275334Z 2025-05-07T20:03:46.3276987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3279665Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3280885Z ^ 2025-05-07T20:03:46.3281247Z 2025-05-07T20:03:46.3282923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3285605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3286814Z ^ 2025-05-07T20:03:46.3287071Z 2025-05-07T20:03:46.3287515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:46.3288187Z 2025-05-07T20:03:46.3289848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3292342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3293405Z ^ 2025-05-07T20:03:46.3293747Z 2025-05-07T20:03:46.3295293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3297707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3298835Z ^ 2025-05-07T20:03:46.3299105Z 2025-05-07T20:03:46.3299732Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:46.3300385Z 2025-05-07T20:03:46.3302037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3304716Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3305944Z ^ 2025-05-07T20:03:46.3306296Z 2025-05-07T20:03:46.3307952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3311002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3312185Z ^ 2025-05-07T20:03:46.3312469Z 2025-05-07T20:03:46.3312912Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:46.3313667Z 2025-05-07T20:03:46.3315373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3318114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3319397Z ^ 2025-05-07T20:03:46.3319742Z 2025-05-07T20:03:49.4164347Z [401/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:03:49.4183153Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:50.9645861Z [402/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:50.9668094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.9670779Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.9671932Z ^ 2025-05-07T20:03:50.9672201Z 2025-05-07T20:03:50.9672666Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.9673316Z 2025-05-07T20:03:50.9674825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.9677468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.9678659Z ^ 2025-05-07T20:03:50.9679002Z 2025-05-07T20:03:50.9680600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.9683190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.9684334Z ^ 2025-05-07T20:03:50.9684575Z 2025-05-07T20:03:50.9685014Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.9685627Z 2025-05-07T20:03:50.9687053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.9689452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.9690816Z ^ 2025-05-07T20:03:50.9691163Z 2025-05-07T20:03:50.9692672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.9695282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.9696411Z ^ 2025-05-07T20:03:50.9696647Z 2025-05-07T20:03:50.9697091Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.9697825Z 2025-05-07T20:03:50.9699670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.9702133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.9703225Z ^ 2025-05-07T20:03:50.9703550Z 2025-05-07T20:03:50.9705106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.9707519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.9708590Z ^ 2025-05-07T20:03:50.9708833Z 2025-05-07T20:03:50.9709291Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.9709921Z 2025-05-07T20:03:50.9711396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.9713868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.9715044Z ^ 2025-05-07T20:03:50.9715405Z 2025-05-07T20:03:50.9716932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.9719510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.9720535Z ^ 2025-05-07T20:03:50.9720793Z 2025-05-07T20:03:50.9721203Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.9721834Z 2025-05-07T20:03:50.9723686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.9726125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.9727224Z ^ 2025-05-07T20:03:50.9727564Z 2025-05-07T20:03:51.5431538Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:51.5457535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.5460397Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.5461536Z ^ 2025-05-07T20:03:51.5461787Z 2025-05-07T20:03:51.5462247Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.5462872Z 2025-05-07T20:03:51.5464411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.5467017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.5468175Z ^ 2025-05-07T20:03:51.5468524Z 2025-05-07T20:03:51.5470002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.5472587Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.5473735Z ^ 2025-05-07T20:03:51.5474042Z 2025-05-07T20:03:51.5474506Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.5475110Z 2025-05-07T20:03:51.5476702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.5479522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.5480836Z ^ 2025-05-07T20:03:51.5481206Z 2025-05-07T20:03:51.5482684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.5485292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.5486590Z ^ 2025-05-07T20:03:51.5486848Z 2025-05-07T20:03:51.5487284Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.5487951Z 2025-05-07T20:03:51.5489555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.5492098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.5493236Z ^ 2025-05-07T20:03:51.5493616Z 2025-05-07T20:03:51.5495129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.5497782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.5499103Z ^ 2025-05-07T20:03:51.5499454Z 2025-05-07T20:03:51.5500129Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.5500658Z 2025-05-07T20:03:51.5502050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.5504412Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.5505472Z ^ 2025-05-07T20:03:51.5505825Z 2025-05-07T20:03:51.5507270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.5509492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.5510590Z ^ 2025-05-07T20:03:51.5510837Z 2025-05-07T20:03:51.5511274Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.5511902Z 2025-05-07T20:03:51.5513369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.5515801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.5516887Z ^ 2025-05-07T20:03:51.5517250Z 2025-05-07T20:03:52.5294575Z [404/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:03:52.5314945Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:53.0677433Z [405/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:53.0702594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.0705506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.0706774Z ^ 2025-05-07T20:03:53.0707044Z 2025-05-07T20:03:53.0707533Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.0708259Z 2025-05-07T20:03:53.0710039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.0712785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.0713996Z ^ 2025-05-07T20:03:53.0714483Z 2025-05-07T20:03:53.0716194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.0719007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.0720251Z ^ 2025-05-07T20:03:53.0720534Z 2025-05-07T20:03:53.0720979Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.0721474Z 2025-05-07T20:03:53.0723335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.0725947Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.0727096Z ^ 2025-05-07T20:03:53.0727474Z 2025-05-07T20:03:53.0729119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.0731860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.0733099Z ^ 2025-05-07T20:03:53.0733360Z 2025-05-07T20:03:53.0733824Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.0734541Z 2025-05-07T20:03:53.0736290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.0739043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.0740244Z ^ 2025-05-07T20:03:53.0740598Z 2025-05-07T20:03:53.0742254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.0744787Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.0745839Z ^ 2025-05-07T20:03:53.0746253Z 2025-05-07T20:03:53.0746735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.0747424Z 2025-05-07T20:03:53.0749250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.0752077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.0753320Z ^ 2025-05-07T20:03:53.0753686Z 2025-05-07T20:03:53.0755420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.0758206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.0759447Z ^ 2025-05-07T20:03:53.0759709Z 2025-05-07T20:03:53.0760175Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.0760854Z 2025-05-07T20:03:53.0762578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.0765336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.0766552Z ^ 2025-05-07T20:03:53.0766925Z 2025-05-07T20:03:56.3823802Z [406/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:03:56.3843919Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:57.0351591Z [407/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:03:57.0372099Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:57.2386885Z [408/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:57.2409042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2411545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2412673Z ^ 2025-05-07T20:03:57.2412928Z 2025-05-07T20:03:57.2413368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.2414054Z 2025-05-07T20:03:57.2415710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2418285Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2419427Z ^ 2025-05-07T20:03:57.2419903Z 2025-05-07T20:03:57.2421470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2424236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2425395Z ^ 2025-05-07T20:03:57.2425650Z 2025-05-07T20:03:57.2426111Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.2426757Z 2025-05-07T20:03:57.2428359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2430974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2432463Z ^ 2025-05-07T20:03:57.2432820Z 2025-05-07T20:03:57.2434342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2436812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2437955Z ^ 2025-05-07T20:03:57.2438190Z 2025-05-07T20:03:57.2438610Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.2439334Z 2025-05-07T20:03:57.2440864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2443424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2444521Z ^ 2025-05-07T20:03:57.2444828Z 2025-05-07T20:03:57.2446393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2448770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2449874Z ^ 2025-05-07T20:03:57.2450127Z 2025-05-07T20:03:57.2450594Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.2451221Z 2025-05-07T20:03:57.2452702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2455137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2456226Z ^ 2025-05-07T20:03:57.2456612Z 2025-05-07T20:03:57.2458153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2460732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2461792Z ^ 2025-05-07T20:03:57.2462058Z 2025-05-07T20:03:57.2462476Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.2463089Z 2025-05-07T20:03:57.2464643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2467096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2468172Z ^ 2025-05-07T20:03:57.2468508Z 2025-05-07T20:03:59.5017721Z [409/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:03:59.5036102Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:59.6555159Z [410/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:03:59.6573399Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:00.1110805Z [411/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:00.1133515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.1136311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.1137456Z ^ 2025-05-07T20:04:00.1137709Z 2025-05-07T20:04:00.1138172Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.1138870Z 2025-05-07T20:04:00.1140570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.1143223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.1144358Z ^ 2025-05-07T20:04:00.1144739Z 2025-05-07T20:04:00.1146229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.1149171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.1150509Z ^ 2025-05-07T20:04:00.1150796Z 2025-05-07T20:04:00.1151200Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.1151815Z 2025-05-07T20:04:00.1153378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.1158621Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.1159970Z ^ 2025-05-07T20:04:00.1160312Z 2025-05-07T20:04:00.1161800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.1164255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.1165363Z ^ 2025-05-07T20:04:00.1165601Z 2025-05-07T20:04:00.1166028Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.1166656Z 2025-05-07T20:04:00.1168238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.1170726Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.1171924Z ^ 2025-05-07T20:04:00.1172288Z 2025-05-07T20:04:00.1173958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.1176465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.1177610Z ^ 2025-05-07T20:04:00.1177901Z 2025-05-07T20:04:00.1178316Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.1178909Z 2025-05-07T20:04:00.1180664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.1183219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.1184345Z ^ 2025-05-07T20:04:00.1184712Z 2025-05-07T20:04:00.1186258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.1188251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.1189146Z ^ 2025-05-07T20:04:00.1189344Z 2025-05-07T20:04:00.1189740Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.1190602Z 2025-05-07T20:04:00.1192163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.1194828Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.1195972Z ^ 2025-05-07T20:04:00.1196331Z 2025-05-07T20:04:00.7496709Z [412/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:04:00.7515196Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:01.7281842Z [413/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:01.7305163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.7307966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.7309062Z ^ 2025-05-07T20:04:01.7309293Z 2025-05-07T20:04:01.7309705Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.7310284Z 2025-05-07T20:04:01.7311828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.7314291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.7315356Z ^ 2025-05-07T20:04:01.7315695Z 2025-05-07T20:04:01.7317175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.7319581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.7320684Z ^ 2025-05-07T20:04:01.7320908Z 2025-05-07T20:04:01.7321329Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.7321911Z 2025-05-07T20:04:01.7323714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.7326109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.7327207Z ^ 2025-05-07T20:04:01.7327541Z 2025-05-07T20:04:01.7328954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.7331681Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.7332783Z ^ 2025-05-07T20:04:01.7333014Z 2025-05-07T20:04:01.7333576Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.7334207Z 2025-05-07T20:04:01.7335711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.7338170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.7339357Z ^ 2025-05-07T20:04:01.7339835Z 2025-05-07T20:04:01.7341277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.7343651Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.7344730Z ^ 2025-05-07T20:04:01.7344964Z 2025-05-07T20:04:01.7345405Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.7346004Z 2025-05-07T20:04:01.7347452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.7349916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.7350999Z ^ 2025-05-07T20:04:01.7351371Z 2025-05-07T20:04:01.7352861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.7355159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.7356186Z ^ 2025-05-07T20:04:01.7356436Z 2025-05-07T20:04:01.7356822Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.7357412Z 2025-05-07T20:04:01.7358868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.7361287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.7362353Z ^ 2025-05-07T20:04:01.7362678Z 2025-05-07T20:04:01.8524410Z [414/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:01.8546256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.8548788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.8549862Z ^ 2025-05-07T20:04:01.8550105Z 2025-05-07T20:04:01.8550530Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.8551107Z 2025-05-07T20:04:01.8552614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.8555048Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.8556136Z ^ 2025-05-07T20:04:01.8556490Z 2025-05-07T20:04:01.8557939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.8560294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.8561335Z ^ 2025-05-07T20:04:01.8561570Z 2025-05-07T20:04:01.8561987Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.8562624Z 2025-05-07T20:04:01.8564073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.8566497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.8567839Z ^ 2025-05-07T20:04:01.8568178Z 2025-05-07T20:04:01.8569638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.8572126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.8573268Z ^ 2025-05-07T20:04:01.8573493Z 2025-05-07T20:04:01.8573925Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.8574673Z 2025-05-07T20:04:01.8576220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.8578666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.8579902Z ^ 2025-05-07T20:04:01.8580241Z 2025-05-07T20:04:01.8581718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.8584126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.8585201Z ^ 2025-05-07T20:04:01.8585463Z 2025-05-07T20:04:01.8585882Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.8586480Z 2025-05-07T20:04:01.8587956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.8590433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.8591585Z ^ 2025-05-07T20:04:01.8591936Z 2025-05-07T20:04:01.8593383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.8595694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.8596763Z ^ 2025-05-07T20:04:01.8596994Z 2025-05-07T20:04:01.8597372Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.8597975Z 2025-05-07T20:04:01.8599403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.8601858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.8602876Z ^ 2025-05-07T20:04:01.8603233Z 2025-05-07T20:04:02.5134978Z [415/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:02.5157158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.5159777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.5160939Z ^ 2025-05-07T20:04:02.5161210Z 2025-05-07T20:04:02.5161545Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.5162122Z 2025-05-07T20:04:02.5163601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.5166116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.5167216Z ^ 2025-05-07T20:04:02.5167587Z 2025-05-07T20:04:02.5169126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.5171634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.5172707Z ^ 2025-05-07T20:04:02.5172935Z 2025-05-07T20:04:02.5173409Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.5174295Z 2025-05-07T20:04:02.5175975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.5178640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.5179962Z ^ 2025-05-07T20:04:02.5180314Z 2025-05-07T20:04:02.5181825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.5184638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.5185769Z ^ 2025-05-07T20:04:02.5186038Z 2025-05-07T20:04:02.5186461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.5187116Z 2025-05-07T20:04:02.5188710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.5191327Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.5192488Z ^ 2025-05-07T20:04:02.5192845Z 2025-05-07T20:04:02.5194454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.5196867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.5198020Z ^ 2025-05-07T20:04:02.5198274Z 2025-05-07T20:04:02.5198710Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.5199298Z 2025-05-07T20:04:02.5200736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.5203208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.5204345Z ^ 2025-05-07T20:04:02.5204681Z 2025-05-07T20:04:02.5206217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.5208743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.5209854Z ^ 2025-05-07T20:04:02.5210102Z 2025-05-07T20:04:02.5210528Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.5211137Z 2025-05-07T20:04:02.5212619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.5215093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.5216195Z ^ 2025-05-07T20:04:02.5216537Z 2025-05-07T20:04:04.4404100Z [416/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:04:05.4075776Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:05.4097108Z [417/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:04:05.4118461Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:06.1142797Z [418/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:06.1163671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1166170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1167317Z ^ 2025-05-07T20:04:06.1167558Z 2025-05-07T20:04:06.1167960Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.1168612Z 2025-05-07T20:04:06.1170096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1172767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1173842Z ^ 2025-05-07T20:04:06.1174404Z 2025-05-07T20:04:06.1176004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1178521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1179693Z ^ 2025-05-07T20:04:06.1180096Z 2025-05-07T20:04:06.1180481Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.1181059Z 2025-05-07T20:04:06.1182552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1184956Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1186055Z ^ 2025-05-07T20:04:06.1186386Z 2025-05-07T20:04:06.1187880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1190321Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1191415Z ^ 2025-05-07T20:04:06.1191656Z 2025-05-07T20:04:06.1192041Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.1192682Z 2025-05-07T20:04:06.1194223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1196685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1197764Z ^ 2025-05-07T20:04:06.1198125Z 2025-05-07T20:04:06.1199619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1201917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1202943Z ^ 2025-05-07T20:04:06.1203178Z 2025-05-07T20:04:06.1203575Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.1204155Z 2025-05-07T20:04:06.1205655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1207806Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1208709Z ^ 2025-05-07T20:04:06.1208982Z 2025-05-07T20:04:06.1210281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1212820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1213897Z ^ 2025-05-07T20:04:06.1214149Z 2025-05-07T20:04:06.1214715Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.1215370Z 2025-05-07T20:04:06.1216840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1219722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1220826Z ^ 2025-05-07T20:04:06.1221151Z 2025-05-07T20:04:06.1940439Z [419/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:06.1959989Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:06.4197681Z [420/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:06.4216248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.4218686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.4219876Z ^ 2025-05-07T20:04:06.4220111Z 2025-05-07T20:04:06.4220541Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.4221144Z 2025-05-07T20:04:06.4223011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.4225683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.4226968Z ^ 2025-05-07T20:04:06.4227322Z 2025-05-07T20:04:06.4228655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.4230849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.4231749Z ^ 2025-05-07T20:04:06.4231968Z 2025-05-07T20:04:06.4232327Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.4232873Z 2025-05-07T20:04:06.4234501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.4237058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.4238030Z ^ 2025-05-07T20:04:06.4238344Z 2025-05-07T20:04:06.4239790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.4241886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.4242948Z ^ 2025-05-07T20:04:06.4243160Z 2025-05-07T20:04:06.4243621Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.4244162Z 2025-05-07T20:04:06.4245416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.4247497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.4248451Z ^ 2025-05-07T20:04:06.4248738Z 2025-05-07T20:04:06.4250074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.4252261Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.4253208Z ^ 2025-05-07T20:04:06.4253424Z 2025-05-07T20:04:06.4253785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.4254322Z 2025-05-07T20:04:06.4255710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.4257888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.4258923Z ^ 2025-05-07T20:04:06.4259209Z 2025-05-07T20:04:06.4260777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.4263017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.4263945Z ^ 2025-05-07T20:04:06.4264151Z 2025-05-07T20:04:06.4264512Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.4265092Z 2025-05-07T20:04:06.4266448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.4268629Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.4269596Z ^ 2025-05-07T20:04:06.4269891Z 2025-05-07T20:04:08.2400327Z [421/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:04:08.2418703Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:08.3275150Z [422/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:08.3298172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:08.3300987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:08.3302102Z ^ 2025-05-07T20:04:08.3302393Z 2025-05-07T20:04:08.3302839Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:08.3303480Z 2025-05-07T20:04:08.3305109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:08.3307603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:08.3308765Z ^ 2025-05-07T20:04:08.3309132Z 2025-05-07T20:04:08.3310751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:08.3313354Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:08.3314450Z ^ 2025-05-07T20:04:08.3314703Z 2025-05-07T20:04:08.3315162Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:08.3315724Z 2025-05-07T20:04:08.3317144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:08.3319425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:08.3320545Z ^ 2025-05-07T20:04:08.3320928Z 2025-05-07T20:04:08.3322753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:08.3325312Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:08.3326412Z ^ 2025-05-07T20:04:08.3326668Z 2025-05-07T20:04:08.3327092Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:08.3327729Z 2025-05-07T20:04:08.3329342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:08.3331887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:08.3333313Z ^ 2025-05-07T20:04:08.3333654Z 2025-05-07T20:04:08.3335314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:08.3337861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:08.3338998Z ^ 2025-05-07T20:04:08.3339245Z 2025-05-07T20:04:08.3339806Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:08.3340604Z 2025-05-07T20:04:08.3342264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:08.3344705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:08.3345798Z ^ 2025-05-07T20:04:08.3346168Z 2025-05-07T20:04:08.3347611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:08.3350022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:08.3351096Z ^ 2025-05-07T20:04:08.3351330Z 2025-05-07T20:04:08.3351759Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:08.3352283Z 2025-05-07T20:04:08.3353606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:08.3355820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:08.3356823Z ^ 2025-05-07T20:04:08.3357127Z 2025-05-07T20:04:08.5366418Z [423/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:08.5386312Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.1142267Z [424/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:09.1162558Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.6672882Z [425/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:04:09.6693762Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.7124247Z [426/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:09.7146155Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.7493838Z [427/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:09.7515578Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.8309598Z [428/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:04:09.8331277Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:10.1328150Z [429/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:10.1349395Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:10.6480240Z [430/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:10.6502979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.6505691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.6506800Z ^ 2025-05-07T20:04:10.6507024Z 2025-05-07T20:04:10.6507454Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:10.6508154Z 2025-05-07T20:04:10.6509762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.6512204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.6513309Z ^ 2025-05-07T20:04:10.6513695Z 2025-05-07T20:04:10.6515198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.6517575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.6518690Z ^ 2025-05-07T20:04:10.6518975Z 2025-05-07T20:04:10.6519404Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:10.6520077Z 2025-05-07T20:04:10.6521873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.6524697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.6525796Z ^ 2025-05-07T20:04:10.6526459Z 2025-05-07T20:04:10.6528030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.6530603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.6531642Z ^ 2025-05-07T20:04:10.6531851Z 2025-05-07T20:04:10.6532231Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:10.6532920Z 2025-05-07T20:04:10.6534434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.6536798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.6537848Z ^ 2025-05-07T20:04:10.6538186Z 2025-05-07T20:04:10.6539857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.6542310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.6543491Z ^ 2025-05-07T20:04:10.6543757Z 2025-05-07T20:04:10.6544255Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:10.6544948Z 2025-05-07T20:04:10.6546574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.6549339Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.6550593Z ^ 2025-05-07T20:04:10.6550895Z 2025-05-07T20:04:10.6552630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.6555247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.6556342Z ^ 2025-05-07T20:04:10.6556599Z 2025-05-07T20:04:10.6557056Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:10.6557737Z 2025-05-07T20:04:10.6559461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.6561884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.6563005Z ^ 2025-05-07T20:04:10.6563380Z 2025-05-07T20:04:10.7563534Z [431/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:10.7583404Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:10.9742024Z [432/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:04:10.9762172Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:11.1025476Z [433/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:11.1050001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.1052784Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.1053993Z ^ 2025-05-07T20:04:11.1054260Z 2025-05-07T20:04:11.1054769Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:11.1055455Z 2025-05-07T20:04:11.1057169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.1059880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.1061071Z ^ 2025-05-07T20:04:11.1061694Z 2025-05-07T20:04:11.1063348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.1066168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.1067401Z ^ 2025-05-07T20:04:11.1067671Z 2025-05-07T20:04:11.1068149Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:11.1068849Z 2025-05-07T20:04:11.1070573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.1073200Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.1074394Z ^ 2025-05-07T20:04:11.1074768Z 2025-05-07T20:04:11.1076449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.1079180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.1080392Z ^ 2025-05-07T20:04:11.1080656Z 2025-05-07T20:04:11.1081087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:11.1081708Z 2025-05-07T20:04:11.1083324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.1086037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.1087263Z ^ 2025-05-07T20:04:11.1087648Z 2025-05-07T20:04:11.1089288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.1092070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.1093245Z ^ 2025-05-07T20:04:11.1093482Z 2025-05-07T20:04:11.1093954Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:11.1094585Z 2025-05-07T20:04:11.1096213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.1098949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.1100373Z ^ 2025-05-07T20:04:11.1100727Z 2025-05-07T20:04:11.1102453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.1105158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.1106276Z ^ 2025-05-07T20:04:11.1106520Z 2025-05-07T20:04:11.1107147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:11.1107815Z 2025-05-07T20:04:11.1109435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.1112276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.1113474Z ^ 2025-05-07T20:04:11.1113857Z 2025-05-07T20:04:11.5871857Z [434/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:11.5894108Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:13.4088425Z [435/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:04:13.4110837Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:13.5716295Z [436/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:13.5742603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.5745944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.5747447Z ^ 2025-05-07T20:04:13.5747752Z 2025-05-07T20:04:13.5748244Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.5748970Z 2025-05-07T20:04:13.5750769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.5753943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.5755291Z ^ 2025-05-07T20:04:13.5755697Z 2025-05-07T20:04:13.5757508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.5760421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.5761729Z ^ 2025-05-07T20:04:13.5762013Z 2025-05-07T20:04:13.5762528Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.5763274Z 2025-05-07T20:04:13.5764868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.5767787Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.5769118Z ^ 2025-05-07T20:04:13.5769520Z 2025-05-07T20:04:13.5771309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.5774208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.5775505Z ^ 2025-05-07T20:04:13.5775814Z 2025-05-07T20:04:13.5776296Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.5777026Z 2025-05-07T20:04:13.5778859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.5781976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.5783251Z ^ 2025-05-07T20:04:13.5783663Z 2025-05-07T20:04:13.5785462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.5788323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.5789652Z ^ 2025-05-07T20:04:13.5789955Z 2025-05-07T20:04:13.5790445Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.5791424Z 2025-05-07T20:04:13.5793233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.5796297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.5797608Z ^ 2025-05-07T20:04:13.5798037Z 2025-05-07T20:04:13.5799817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.5802986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.5804298Z ^ 2025-05-07T20:04:13.5804612Z 2025-05-07T20:04:13.5805096Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.5805813Z 2025-05-07T20:04:13.5807650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.5810555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.5811883Z ^ 2025-05-07T20:04:13.5812294Z 2025-05-07T20:04:14.4191087Z [437/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:04:14.4213519Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:15.1037462Z [438/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:15.1058984Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:15.2980897Z [439/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:04:15.3000132Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:15.4045315Z [440/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:04:15.4063575Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:15.7373591Z [441/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:04:15.7391702Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:17.3138368Z [442/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:17.3156974Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:17.3640100Z [443/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:04:17.3659720Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:17.4069309Z [444/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:04:17.4088631Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:17.6898902Z [445/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:17.6925409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.6928433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.6929805Z ^ 2025-05-07T20:04:17.6930105Z 2025-05-07T20:04:17.6930659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:17.6931476Z 2025-05-07T20:04:17.6933264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.6936505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.6937997Z ^ 2025-05-07T20:04:17.6938410Z 2025-05-07T20:04:17.6940377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.6943586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.6944908Z ^ 2025-05-07T20:04:17.6945195Z 2025-05-07T20:04:17.6945681Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:17.6946446Z 2025-05-07T20:04:17.6948245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.6951197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.6952505Z ^ 2025-05-07T20:04:17.6952913Z 2025-05-07T20:04:17.6954729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.6957642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.6958957Z ^ 2025-05-07T20:04:17.6959239Z 2025-05-07T20:04:17.6959761Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:17.6960486Z 2025-05-07T20:04:17.6962289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.6965246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.6966574Z ^ 2025-05-07T20:04:17.6966982Z 2025-05-07T20:04:17.6968800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.6971771Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.6973106Z ^ 2025-05-07T20:04:17.6973393Z 2025-05-07T20:04:17.6973901Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:17.6974653Z 2025-05-07T20:04:17.6976491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.6979221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.6980715Z ^ 2025-05-07T20:04:17.6981096Z 2025-05-07T20:04:17.6982906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.6985996Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.6987320Z ^ 2025-05-07T20:04:17.6987713Z 2025-05-07T20:04:17.6988237Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:17.6988964Z 2025-05-07T20:04:17.6990751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.6993901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.6995204Z ^ 2025-05-07T20:04:17.6995647Z 2025-05-07T20:04:18.2738862Z [446/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:04:18.2761128Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.6971454Z [447/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:04:18.6990614Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.7505134Z [448/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:18.7527746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7530743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7531924Z ^ 2025-05-07T20:04:18.7532175Z 2025-05-07T20:04:18.7532606Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7533275Z 2025-05-07T20:04:18.7534972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7537743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7538824Z ^ 2025-05-07T20:04:18.7539169Z 2025-05-07T20:04:18.7540951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7543361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7544507Z ^ 2025-05-07T20:04:18.7544821Z 2025-05-07T20:04:18.7545295Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7545999Z 2025-05-07T20:04:18.7547494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7549709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7550830Z ^ 2025-05-07T20:04:18.7551177Z 2025-05-07T20:04:18.7552597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7555144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7556268Z ^ 2025-05-07T20:04:18.7556527Z 2025-05-07T20:04:18.7556948Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7557549Z 2025-05-07T20:04:18.7559044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7561514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7562794Z ^ 2025-05-07T20:04:18.7563153Z 2025-05-07T20:04:18.7564765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7567648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7568758Z ^ 2025-05-07T20:04:18.7569046Z 2025-05-07T20:04:18.7570844Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7571506Z 2025-05-07T20:04:18.7572985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7575753Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7576866Z ^ 2025-05-07T20:04:18.7577177Z 2025-05-07T20:04:18.7578631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7581398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7582552Z ^ 2025-05-07T20:04:18.7582840Z 2025-05-07T20:04:18.7583241Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7583833Z 2025-05-07T20:04:18.7585402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7587905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7589097Z ^ 2025-05-07T20:04:18.7589456Z 2025-05-07T20:04:18.7642103Z [449/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:04:18.7661169Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.7996928Z [450/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:18.8016010Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.8159519Z [451/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:18.8182744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.8185389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.8186670Z ^ 2025-05-07T20:04:18.8186908Z 2025-05-07T20:04:18.8187285Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.8187867Z 2025-05-07T20:04:18.8189599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.8192061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.8193094Z ^ 2025-05-07T20:04:18.8193427Z 2025-05-07T20:04:18.8194986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.8197581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.8198759Z ^ 2025-05-07T20:04:18.8199045Z 2025-05-07T20:04:18.8199513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.8200092Z 2025-05-07T20:04:18.8201495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.8204074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.8205061Z ^ 2025-05-07T20:04:18.8205386Z 2025-05-07T20:04:18.8206774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.8209377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.8210603Z ^ 2025-05-07T20:04:18.8210869Z 2025-05-07T20:04:18.8211274Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.8211870Z 2025-05-07T20:04:18.8213480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.8216265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.8217464Z ^ 2025-05-07T20:04:18.8217815Z 2025-05-07T20:04:18.8219303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.8222334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.8223588Z ^ 2025-05-07T20:04:18.8223797Z 2025-05-07T20:04:18.8224174Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.8224767Z 2025-05-07T20:04:18.8226385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.8228889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.8229892Z ^ 2025-05-07T20:04:18.8230308Z 2025-05-07T20:04:18.8232024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.8234543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.8235715Z ^ 2025-05-07T20:04:18.8236029Z 2025-05-07T20:04:18.8236498Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.8237135Z 2025-05-07T20:04:18.8238770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.8241227Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.8242432Z ^ 2025-05-07T20:04:18.8242784Z 2025-05-07T20:04:18.9850927Z [452/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:04:18.9869954Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:19.1313296Z [453/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:04:19.1332427Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:20.7973692Z [454/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:20.7998176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8001060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8002340Z ^ 2025-05-07T20:04:20.8002617Z 2025-05-07T20:04:20.8003097Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:20.8003823Z 2025-05-07T20:04:20.8005539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8008094Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8009559Z ^ 2025-05-07T20:04:20.8009962Z 2025-05-07T20:04:20.8011789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8014729Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8015929Z ^ 2025-05-07T20:04:20.8016226Z 2025-05-07T20:04:20.8016697Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:20.8017475Z 2025-05-07T20:04:20.8019305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8022529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8023825Z ^ 2025-05-07T20:04:20.8024186Z 2025-05-07T20:04:20.8025879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8028543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8029707Z ^ 2025-05-07T20:04:20.8029937Z 2025-05-07T20:04:20.8030397Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:20.8031078Z 2025-05-07T20:04:20.8032708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8035535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8036673Z ^ 2025-05-07T20:04:20.8037065Z 2025-05-07T20:04:20.8038736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8041335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8042558Z ^ 2025-05-07T20:04:20.8042821Z 2025-05-07T20:04:20.8043329Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:20.8044037Z 2025-05-07T20:04:20.8045799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8048670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8049958Z ^ 2025-05-07T20:04:20.8050343Z 2025-05-07T20:04:20.8052108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8054888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8056111Z ^ 2025-05-07T20:04:20.8056591Z 2025-05-07T20:04:20.8057080Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:20.8057752Z 2025-05-07T20:04:20.8059326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8062070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8063334Z ^ 2025-05-07T20:04:20.8063705Z 2025-05-07T20:04:21.6149163Z [455/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:21.6170319Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:23.0656626Z [456/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:23.0675546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.0677727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.0678779Z ^ 2025-05-07T20:04:23.0678993Z 2025-05-07T20:04:23.0679377Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.0679920Z 2025-05-07T20:04:23.0681251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.0683386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.0684371Z ^ 2025-05-07T20:04:23.0684657Z 2025-05-07T20:04:23.0685967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.0688066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.0689000Z ^ 2025-05-07T20:04:23.0689243Z 2025-05-07T20:04:23.0689608Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.0690139Z 2025-05-07T20:04:23.0691457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.0693600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.0694599Z ^ 2025-05-07T20:04:23.0694900Z 2025-05-07T20:04:23.0696213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.0698535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.0699767Z ^ 2025-05-07T20:04:23.0699983Z 2025-05-07T20:04:23.0700344Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.0700897Z 2025-05-07T20:04:23.0702198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.0704522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.0705484Z ^ 2025-05-07T20:04:23.0705802Z 2025-05-07T20:04:23.0707062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.0709165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.0710099Z ^ 2025-05-07T20:04:23.0710340Z 2025-05-07T20:04:23.0710698Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.0711227Z 2025-05-07T20:04:23.0712539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.0714651Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.0715639Z ^ 2025-05-07T20:04:23.0715922Z 2025-05-07T20:04:23.0717206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.0719309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.0720286Z ^ 2025-05-07T20:04:23.0720492Z 2025-05-07T20:04:23.0720843Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.0721405Z 2025-05-07T20:04:23.0722975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.0725079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.0726022Z ^ 2025-05-07T20:04:23.0726329Z 2025-05-07T20:04:24.1920044Z [457/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:04:24.1936428Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:24.2140892Z [458/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:24.2160860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.2163133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.2164295Z ^ 2025-05-07T20:04:24.2164522Z 2025-05-07T20:04:24.2165046Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:24.2165622Z 2025-05-07T20:04:24.2167119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.2169762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.2170975Z ^ 2025-05-07T20:04:24.2171334Z 2025-05-07T20:04:24.2172956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.2175716Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.2176760Z ^ 2025-05-07T20:04:24.2177005Z 2025-05-07T20:04:24.2177429Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:24.2178113Z 2025-05-07T20:04:24.2180081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.2182882Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.2184109Z ^ 2025-05-07T20:04:24.2184485Z 2025-05-07T20:04:24.2186247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.2189038Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.2190284Z ^ 2025-05-07T20:04:24.2190545Z 2025-05-07T20:04:24.2191041Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:24.2191729Z 2025-05-07T20:04:24.2193436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.2196184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.2197397Z ^ 2025-05-07T20:04:24.2197773Z 2025-05-07T20:04:24.2199459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.2202333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.2203412Z ^ 2025-05-07T20:04:24.2203664Z 2025-05-07T20:04:24.2204100Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:24.2204812Z 2025-05-07T20:04:24.2206275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.2208878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.2210156Z ^ 2025-05-07T20:04:24.2210589Z 2025-05-07T20:04:24.2212254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.2214903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.2216064Z ^ 2025-05-07T20:04:24.2216306Z 2025-05-07T20:04:24.2216790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:24.2217468Z 2025-05-07T20:04:24.2219139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.2222172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.2223305Z ^ 2025-05-07T20:04:24.2223697Z 2025-05-07T20:04:24.8383616Z [459/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T20:04:25.4754718Z [460/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:04:25.4768752Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:25.6851435Z [461/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:25.6882907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:25.6886341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:25.6887856Z ^ 2025-05-07T20:04:25.6888170Z 2025-05-07T20:04:25.6888721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:25.6889568Z 2025-05-07T20:04:25.6891661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:25.6895049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:25.6896538Z ^ 2025-05-07T20:04:25.6897045Z 2025-05-07T20:04:25.6899073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:25.6902507Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:25.6903971Z ^ 2025-05-07T20:04:25.6904287Z 2025-05-07T20:04:25.6904874Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:25.6905719Z 2025-05-07T20:04:25.6907975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:25.6911528Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:25.6913071Z ^ 2025-05-07T20:04:25.6913514Z 2025-05-07T20:04:25.6915504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:25.6918892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:25.6920401Z ^ 2025-05-07T20:04:25.6920714Z 2025-05-07T20:04:25.6921243Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:25.6922376Z 2025-05-07T20:04:25.6924867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:25.6928580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:25.6930153Z ^ 2025-05-07T20:04:25.6930615Z 2025-05-07T20:04:25.6932863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:25.6936395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:25.6937889Z ^ 2025-05-07T20:04:25.6938233Z 2025-05-07T20:04:25.6938793Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:25.6939758Z 2025-05-07T20:04:25.6941815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:25.6945095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:25.6946531Z ^ 2025-05-07T20:04:25.6947006Z 2025-05-07T20:04:25.6949017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:25.6952320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:25.6953734Z ^ 2025-05-07T20:04:25.6954059Z 2025-05-07T20:04:25.6954572Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:25.6955074Z 2025-05-07T20:04:25.6956571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:25.6959772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:25.6961417Z ^ 2025-05-07T20:04:25.6961902Z 2025-05-07T20:04:26.2853845Z [462/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:04:26.2867974Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:26.9115086Z [463/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:04:26.9131282Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:26.9588563Z [464/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:26.9605465Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:28.6465775Z [465/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:04:28.6485179Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:29.2314702Z [466/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:04:29.2330597Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:29.3230461Z [467/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:04:29.3244572Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:29.6814609Z [468/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:04:29.6830693Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.7871823Z [469/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:04:30.7890121Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.8877411Z [470/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:04:30.8895471Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:31.3709722Z [471/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:04:31.3728649Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:32.4571127Z [472/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:04:32.4590470Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:32.8366330Z [473/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:32.8384329Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:34.5415711Z [474/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:04:34.5433689Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:34.5579885Z [475/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:04:34.5597603Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:34.5936096Z [476/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:04:34.5954028Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:35.5215858Z [477/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:04:35.5231503Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:35.8354372Z [478/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:04:35.8368773Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:35.9585901Z [479/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:04:35.9600755Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:36.6514992Z [480/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:04:36.6534180Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:38.3346490Z [481/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:04:38.3364940Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:39.9498027Z [482/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:04:39.9521830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9524981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9526204Z ^ 2025-05-07T20:04:39.9526519Z 2025-05-07T20:04:39.9526982Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.9527680Z 2025-05-07T20:04:39.9529334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9531964Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9533204Z ^ 2025-05-07T20:04:39.9533571Z 2025-05-07T20:04:39.9535167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9537687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9538891Z ^ 2025-05-07T20:04:39.9539153Z 2025-05-07T20:04:39.9539775Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.9540417Z 2025-05-07T20:04:39.9542171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9544826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9546307Z ^ 2025-05-07T20:04:39.9546697Z 2025-05-07T20:04:39.9548551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9551299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9552480Z ^ 2025-05-07T20:04:39.9552748Z 2025-05-07T20:04:39.9553358Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.9554017Z 2025-05-07T20:04:39.9555755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9558480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9559628Z ^ 2025-05-07T20:04:39.9559973Z 2025-05-07T20:04:39.9561653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9564405Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9565654Z ^ 2025-05-07T20:04:39.9565919Z 2025-05-07T20:04:39.9566396Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.9567100Z 2025-05-07T20:04:39.9568764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9571385Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9572507Z ^ 2025-05-07T20:04:39.9572892Z 2025-05-07T20:04:39.9574260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9576984Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9578024Z ^ 2025-05-07T20:04:39.9578270Z 2025-05-07T20:04:39.9578761Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.9579377Z 2025-05-07T20:04:39.9581002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9583612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9584831Z ^ 2025-05-07T20:04:39.9585185Z 2025-05-07T20:04:40.9525466Z [483/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:04:40.9544458Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:42.2432825Z [484/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:04:42.2450729Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:42.3182877Z [485/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:04:42.3201011Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:42.9923252Z [486/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:04:42.9941564Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:43.1642104Z [487/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:04:43.1660970Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:43.6898115Z [488/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:04:43.6917546Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:44.0888929Z [489/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:04:44.0912999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.0915846Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.0917035Z ^ 2025-05-07T20:04:44.0917298Z 2025-05-07T20:04:44.0917771Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.0918486Z 2025-05-07T20:04:44.0920205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.0923642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.0924899Z ^ 2025-05-07T20:04:44.0925286Z 2025-05-07T20:04:44.0927158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.0929919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.0931156Z ^ 2025-05-07T20:04:44.0931409Z 2025-05-07T20:04:44.0931982Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.0932664Z 2025-05-07T20:04:44.0934471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.0937203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.0938416Z ^ 2025-05-07T20:04:44.0938798Z 2025-05-07T20:04:44.0940690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.0943421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.0944620Z ^ 2025-05-07T20:04:44.0944862Z 2025-05-07T20:04:44.0945229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.0945861Z 2025-05-07T20:04:44.0947498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.0950088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.0951263Z ^ 2025-05-07T20:04:44.0951629Z 2025-05-07T20:04:44.0953129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.0955775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.0956915Z ^ 2025-05-07T20:04:44.0957157Z 2025-05-07T20:04:44.0957603Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.0958290Z 2025-05-07T20:04:44.0959924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.0962398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.0963447Z ^ 2025-05-07T20:04:44.0963774Z 2025-05-07T20:04:44.0965383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.0968204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.0969381Z ^ 2025-05-07T20:04:44.0969634Z 2025-05-07T20:04:44.0970109Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.0970762Z 2025-05-07T20:04:44.0972619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.0975466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.0976788Z ^ 2025-05-07T20:04:44.0977174Z 2025-05-07T20:04:44.2387230Z [490/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:04:44.2405294Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:44.7485792Z [491/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:04:44.7504324Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:45.0677306Z [492/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:04:45.0700846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0703834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0705043Z ^ 2025-05-07T20:04:45.0705307Z 2025-05-07T20:04:45.0705751Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.0706607Z 2025-05-07T20:04:45.0708334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0711110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0712382Z ^ 2025-05-07T20:04:45.0713883Z 2025-05-07T20:04:45.0715557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0718120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0719268Z ^ 2025-05-07T20:04:45.0719526Z 2025-05-07T20:04:45.0719952Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.0720659Z 2025-05-07T20:04:45.0722694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0725502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0726619Z ^ 2025-05-07T20:04:45.0726965Z 2025-05-07T20:04:45.0728555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0730994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0732038Z ^ 2025-05-07T20:04:45.0732292Z 2025-05-07T20:04:45.0732709Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.0733363Z 2025-05-07T20:04:45.0734954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0737634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0738765Z ^ 2025-05-07T20:04:45.0739053Z 2025-05-07T20:04:45.0740667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0743232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0744311Z ^ 2025-05-07T20:04:45.0744559Z 2025-05-07T20:04:45.0745002Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.0745672Z 2025-05-07T20:04:45.0747203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0749816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0750961Z ^ 2025-05-07T20:04:45.0751284Z 2025-05-07T20:04:45.0752979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0755911Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0757271Z ^ 2025-05-07T20:04:45.0757544Z 2025-05-07T20:04:45.0758094Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.0758737Z 2025-05-07T20:04:45.0760505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0763100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0764268Z ^ 2025-05-07T20:04:45.0764622Z 2025-05-07T20:04:45.3082308Z [493/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:04:45.3100403Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:45.9311308Z [494/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:04:45.9331718Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:46.2092965Z [495/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:04:46.2115828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.2118850Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.2119978Z ^ 2025-05-07T20:04:46.2120210Z 2025-05-07T20:04:46.2120668Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:46.2121299Z 2025-05-07T20:04:46.2123224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.2125767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.2126919Z ^ 2025-05-07T20:04:46.2127256Z 2025-05-07T20:04:46.2128897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.2131465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.2132579Z ^ 2025-05-07T20:04:46.2132834Z 2025-05-07T20:04:46.2133274Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:46.2133953Z 2025-05-07T20:04:46.2135603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.2138196Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.2139320Z ^ 2025-05-07T20:04:46.2139824Z 2025-05-07T20:04:46.2141162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:46.2142883Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:46.2143439Z ^ 2025-05-07T20:04:46.2143676Z 2025-05-07T20:04:46.2145251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.2147805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.2148964Z ^ 2025-05-07T20:04:46.2149202Z 2025-05-07T20:04:46.2149612Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:46.2150290Z 2025-05-07T20:04:46.2151823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.2154818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.2155950Z ^ 2025-05-07T20:04:46.2156300Z 2025-05-07T20:04:46.2157786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:46.2159559Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:46.2160096Z ^ 2025-05-07T20:04:46.2160519Z 2025-05-07T20:04:46.2162235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.2164852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.2165969Z ^ 2025-05-07T20:04:46.2166192Z 2025-05-07T20:04:46.2166612Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:46.2167217Z 2025-05-07T20:04:46.2168815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.2171162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.2172229Z ^ 2025-05-07T20:04:46.2172570Z 2025-05-07T20:04:46.2173923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:46.2175587Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:46.2176143Z ^ 2025-05-07T20:04:46.2176417Z 2025-05-07T20:04:46.2177955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.2180907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.2182060Z ^ 2025-05-07T20:04:46.2182350Z 2025-05-07T20:04:46.2182791Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:46.2183404Z 2025-05-07T20:04:46.2185073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.2187778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.2189006Z ^ 2025-05-07T20:04:46.2189359Z 2025-05-07T20:04:46.2190781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:46.2192596Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:46.2193187Z ^ 2025-05-07T20:04:46.2193453Z 2025-05-07T20:04:46.5236877Z [496/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:04:46.5255589Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:46.5686336Z [497/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:04:46.5704985Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:47.0741651Z [498/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:04:47.0760301Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:47.2817372Z [499/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:04:47.2839422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.2842006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.2843119Z ^ 2025-05-07T20:04:47.2843351Z 2025-05-07T20:04:47.2843877Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.2844509Z 2025-05-07T20:04:47.2846074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.2848609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.2849775Z ^ 2025-05-07T20:04:47.2850087Z 2025-05-07T20:04:47.2851540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.2853995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.2855070Z ^ 2025-05-07T20:04:47.2855339Z 2025-05-07T20:04:47.2855763Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.2856383Z 2025-05-07T20:04:47.2857897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.2860481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.2861595Z ^ 2025-05-07T20:04:47.2861948Z 2025-05-07T20:04:47.2863254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:47.2864951Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:47.2865516Z ^ 2025-05-07T20:04:47.2865775Z 2025-05-07T20:04:47.2867324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.2870141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.2871264Z ^ 2025-05-07T20:04:47.2871503Z 2025-05-07T20:04:47.2871921Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.2872588Z 2025-05-07T20:04:47.2874342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.2876962Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.2878246Z ^ 2025-05-07T20:04:47.2878598Z 2025-05-07T20:04:47.2879985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:47.2881750Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:47.2882269Z ^ 2025-05-07T20:04:47.2882518Z 2025-05-07T20:04:47.2884043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.2886491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.2887593Z ^ 2025-05-07T20:04:47.2887829Z 2025-05-07T20:04:47.2888223Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.2888864Z 2025-05-07T20:04:47.2890416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.2892983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.2894045Z ^ 2025-05-07T20:04:47.2894444Z 2025-05-07T20:04:47.2895678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:47.2897240Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:47.2897732Z ^ 2025-05-07T20:04:47.2898011Z 2025-05-07T20:04:47.2899535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.2902133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.2903203Z ^ 2025-05-07T20:04:47.2903449Z 2025-05-07T20:04:47.2903896Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.2904518Z 2025-05-07T20:04:47.2906028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.2908389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.2909559Z ^ 2025-05-07T20:04:47.2909922Z 2025-05-07T20:04:47.2911205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:47.2913116Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:47.2913608Z ^ 2025-05-07T20:04:47.2913880Z 2025-05-07T20:04:47.8358368Z [500/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:04:47.8373707Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:48.5570244Z [501/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:04:51.4564063Z [502/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:04:51.4582130Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:52.4269589Z [503/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:04:52.4291162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:52.4293728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:52.4294703Z ^ 2025-05-07T20:04:52.4294938Z 2025-05-07T20:04:52.4295349Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:52.4295957Z 2025-05-07T20:04:52.4297354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:52.4299969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:52.4301068Z ^ 2025-05-07T20:04:52.4301408Z 2025-05-07T20:04:52.4302932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:52.4305761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:52.4306945Z ^ 2025-05-07T20:04:52.4307207Z 2025-05-07T20:04:52.4307763Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:52.4308443Z 2025-05-07T20:04:52.4310045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:52.4312752Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:52.4313815Z ^ 2025-05-07T20:04:52.4314194Z 2025-05-07T20:04:52.4315750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:52.4318351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:52.4319488Z ^ 2025-05-07T20:04:52.4319747Z 2025-05-07T20:04:52.4320226Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:52.4320868Z 2025-05-07T20:04:52.4322713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:52.4325326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:52.4326357Z ^ 2025-05-07T20:04:52.4326690Z 2025-05-07T20:04:52.4328131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:52.4330559Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:52.4331681Z ^ 2025-05-07T20:04:52.4331927Z 2025-05-07T20:04:52.4332339Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:52.4333001Z 2025-05-07T20:04:52.4334509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:52.4337059Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:52.4338021Z ^ 2025-05-07T20:04:52.4338334Z 2025-05-07T20:04:52.4340024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:52.4342367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:52.4343508Z ^ 2025-05-07T20:04:52.4343764Z 2025-05-07T20:04:52.4344225Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:52.4344770Z 2025-05-07T20:04:52.4346269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:52.4348619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:52.4349911Z ^ 2025-05-07T20:04:52.4350263Z 2025-05-07T20:04:54.8583185Z [504/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:04:54.8604267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8606627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8607661Z ^ 2025-05-07T20:04:54.8607885Z 2025-05-07T20:04:54.8608210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.8608818Z 2025-05-07T20:04:54.8610300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8612701Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8614124Z ^ 2025-05-07T20:04:54.8614472Z 2025-05-07T20:04:54.8616108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8618610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8619925Z ^ 2025-05-07T20:04:54.8620290Z 2025-05-07T20:04:54.8620700Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.8621324Z 2025-05-07T20:04:54.8623286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8625708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8626797Z ^ 2025-05-07T20:04:54.8627104Z 2025-05-07T20:04:54.8628598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8631032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8632139Z ^ 2025-05-07T20:04:54.8632398Z 2025-05-07T20:04:54.8632821Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.8633380Z 2025-05-07T20:04:54.8634895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8637296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8638402Z ^ 2025-05-07T20:04:54.8638738Z 2025-05-07T20:04:54.8640150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8642524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8643649Z ^ 2025-05-07T20:04:54.8643884Z 2025-05-07T20:04:54.8644314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.8644922Z 2025-05-07T20:04:54.8646394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8648814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8649846Z ^ 2025-05-07T20:04:54.8650174Z 2025-05-07T20:04:54.8651624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8654040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8655382Z ^ 2025-05-07T20:04:54.8655611Z 2025-05-07T20:04:54.8656023Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.8656642Z 2025-05-07T20:04:54.8658381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8660590Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8661752Z ^ 2025-05-07T20:04:54.8662119Z 2025-05-07T20:04:57.5135640Z [505/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:57.5160421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.5163348Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.5164562Z ^ 2025-05-07T20:04:57.5164800Z 2025-05-07T20:04:57.5165526Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:57.5166193Z 2025-05-07T20:04:57.5167864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.5170642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.5171810Z ^ 2025-05-07T20:04:57.5172180Z 2025-05-07T20:04:57.5173734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.5176627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.5177856Z ^ 2025-05-07T20:04:57.5178159Z 2025-05-07T20:04:57.5178627Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:57.5179305Z 2025-05-07T20:04:57.5181112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.5183797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.5185017Z ^ 2025-05-07T20:04:57.5185375Z 2025-05-07T20:04:57.5187088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.5189623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.5190793Z ^ 2025-05-07T20:04:57.5191062Z 2025-05-07T20:04:57.5191562Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:57.5192242Z 2025-05-07T20:04:57.5193991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.5196651Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.5197602Z ^ 2025-05-07T20:04:57.5197892Z 2025-05-07T20:04:57.5199218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.5201707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.5202796Z ^ 2025-05-07T20:04:57.5203059Z 2025-05-07T20:04:57.5203499Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:57.5204187Z 2025-05-07T20:04:57.5205831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.5208517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.5209792Z ^ 2025-05-07T20:04:57.5210375Z 2025-05-07T20:04:57.5212129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.5214747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.5215951Z ^ 2025-05-07T20:04:57.5216200Z 2025-05-07T20:04:57.5216650Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:57.5217335Z 2025-05-07T20:04:57.5220497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.5223319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.5224583Z ^ 2025-05-07T20:04:57.5224987Z 2025-05-07T20:04:57.6599387Z [506/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:04:57.6618611Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:57.7753982Z [507/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:57.7778665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.7781718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.7782890Z ^ 2025-05-07T20:04:57.7783173Z 2025-05-07T20:04:57.7783578Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:57.7784194Z 2025-05-07T20:04:57.7785804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.7788593Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.7789771Z ^ 2025-05-07T20:04:57.7790216Z 2025-05-07T20:04:57.7791879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.7794563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.7795743Z ^ 2025-05-07T20:04:57.7795992Z 2025-05-07T20:04:57.7796417Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:57.7797082Z 2025-05-07T20:04:57.7798739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.7801607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.7802837Z ^ 2025-05-07T20:04:57.7803246Z 2025-05-07T20:04:57.7805087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.7807764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.7809058Z ^ 2025-05-07T20:04:57.7809337Z 2025-05-07T20:04:57.7809884Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:57.7810564Z 2025-05-07T20:04:57.7812249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.7814858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.7815806Z ^ 2025-05-07T20:04:57.7816142Z 2025-05-07T20:04:57.7817603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.7820245Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.7821484Z ^ 2025-05-07T20:04:57.7821720Z 2025-05-07T20:04:57.7822375Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:57.7823056Z 2025-05-07T20:04:57.7824777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.7827502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.7828666Z ^ 2025-05-07T20:04:57.7829055Z 2025-05-07T20:04:57.7830717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.7833423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.7834615Z ^ 2025-05-07T20:04:57.7834865Z 2025-05-07T20:04:57.7835313Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:57.7835980Z 2025-05-07T20:04:57.7837736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:57.7840509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:57.7841768Z ^ 2025-05-07T20:04:57.7842156Z 2025-05-07T20:05:02.0943219Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:05:02.0962910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.0965284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.0966285Z ^ 2025-05-07T20:05:02.0966525Z 2025-05-07T20:05:02.0966906Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.0967456Z 2025-05-07T20:05:02.0968829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.0971193Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.0972165Z ^ 2025-05-07T20:05:02.0972470Z 2025-05-07T20:05:02.0973777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.0976099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.0977142Z ^ 2025-05-07T20:05:02.0977357Z 2025-05-07T20:05:02.0977898Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.0978472Z 2025-05-07T20:05:02.0979979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.0982483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.0983451Z ^ 2025-05-07T20:05:02.0983783Z 2025-05-07T20:05:02.0985167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.0987589Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.0988555Z ^ 2025-05-07T20:05:02.0988765Z 2025-05-07T20:05:02.0989156Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.0989696Z 2025-05-07T20:05:02.0991051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.0993459Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.0994448Z ^ 2025-05-07T20:05:02.0994745Z 2025-05-07T20:05:02.0996081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.0998407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.0999417Z ^ 2025-05-07T20:05:02.0999623Z 2025-05-07T20:05:02.0999986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.1000558Z 2025-05-07T20:05:02.1001885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.1004283Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.1005236Z ^ 2025-05-07T20:05:02.1005538Z 2025-05-07T20:05:02.1006880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.1009120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.1010198Z ^ 2025-05-07T20:05:02.1010406Z 2025-05-07T20:05:02.1010795Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.1011340Z 2025-05-07T20:05:02.1012677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.1015019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.1016175Z ^ 2025-05-07T20:05:02.1016475Z 2025-05-07T20:05:07.5524079Z [509/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:05:07.5546632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:07.5549114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:07.5550228Z ^ 2025-05-07T20:05:07.5550488Z 2025-05-07T20:05:07.5550933Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:07.5551617Z 2025-05-07T20:05:07.5553147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:07.5555628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:07.5556735Z ^ 2025-05-07T20:05:07.5557113Z 2025-05-07T20:05:07.5558622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:07.5561488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:07.5562642Z ^ 2025-05-07T20:05:07.5562900Z 2025-05-07T20:05:07.5563511Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:07.5564161Z 2025-05-07T20:05:07.5565802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:07.5568567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:07.5569740Z ^ 2025-05-07T20:05:07.5570074Z 2025-05-07T20:05:07.5571540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:07.5573949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:07.5575044Z ^ 2025-05-07T20:05:07.5575278Z 2025-05-07T20:05:07.5575701Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:07.5576371Z 2025-05-07T20:05:07.5577824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:07.5580456Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:07.5581556Z ^ 2025-05-07T20:05:07.5581885Z 2025-05-07T20:05:07.5583431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:07.5585874Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:07.5587046Z ^ 2025-05-07T20:05:07.5587282Z 2025-05-07T20:05:07.5587720Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:07.5588317Z 2025-05-07T20:05:07.5589867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:07.5592373Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:07.5593500Z ^ 2025-05-07T20:05:07.5593885Z 2025-05-07T20:05:07.5595365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:07.5597860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:07.5598915Z ^ 2025-05-07T20:05:07.5599189Z 2025-05-07T20:05:07.5599615Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:07.5600241Z 2025-05-07T20:05:07.5601772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:07.5604474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:07.5605654Z ^ 2025-05-07T20:05:07.5606126Z 2025-05-07T20:05:12.2598809Z [510/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:05:12.2621610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.2624492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.2625646Z ^ 2025-05-07T20:05:12.2625888Z 2025-05-07T20:05:12.2626317Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:12.2626989Z 2025-05-07T20:05:12.2628607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.2631214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.2632597Z ^ 2025-05-07T20:05:12.2632977Z 2025-05-07T20:05:12.2634724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.2637288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.2638453Z ^ 2025-05-07T20:05:12.2638712Z 2025-05-07T20:05:12.2639271Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:12.2639929Z 2025-05-07T20:05:12.2641701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.2644374Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.2645533Z ^ 2025-05-07T20:05:12.2645894Z 2025-05-07T20:05:12.2647573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.2650131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.2651240Z ^ 2025-05-07T20:05:12.2651496Z 2025-05-07T20:05:12.2651951Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:12.2652630Z 2025-05-07T20:05:12.2654320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.2656985Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.2658160Z ^ 2025-05-07T20:05:12.2658524Z 2025-05-07T20:05:12.2660259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.2662537Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.2663633Z ^ 2025-05-07T20:05:12.2663898Z 2025-05-07T20:05:12.2664312Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:12.2664935Z 2025-05-07T20:05:12.2666443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.2668914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.2670053Z ^ 2025-05-07T20:05:12.2670405Z 2025-05-07T20:05:12.2671971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.2674575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.2675868Z ^ 2025-05-07T20:05:12.2676091Z 2025-05-07T20:05:12.2676503Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:12.2677178Z 2025-05-07T20:05:12.2678849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.2681393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.2682505Z ^ 2025-05-07T20:05:12.2682938Z 2025-05-07T20:05:14.8814287Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:05:14.8833103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.8835121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.8836118Z ^ 2025-05-07T20:05:14.8836343Z 2025-05-07T20:05:14.8836721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.8837289Z 2025-05-07T20:05:14.8838545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.8841142Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.8842056Z ^ 2025-05-07T20:05:14.8842475Z 2025-05-07T20:05:14.8843823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.8846022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.8846959Z ^ 2025-05-07T20:05:14.8847328Z 2025-05-07T20:05:14.8847703Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.8848252Z 2025-05-07T20:05:14.8849550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.8851755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.8852755Z ^ 2025-05-07T20:05:14.8853086Z 2025-05-07T20:05:14.8854401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.8856622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.8857585Z ^ 2025-05-07T20:05:14.8857830Z 2025-05-07T20:05:14.8858202Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.8858818Z 2025-05-07T20:05:14.8860265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.8862475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.8863422Z ^ 2025-05-07T20:05:14.8863718Z 2025-05-07T20:05:14.8864941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.8867089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.8868097Z ^ 2025-05-07T20:05:14.8868289Z 2025-05-07T20:05:14.8868676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.8869261Z 2025-05-07T20:05:14.8870564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.8872553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.8873425Z ^ 2025-05-07T20:05:14.8873732Z 2025-05-07T20:05:14.8874934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.8877041Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.8877895Z ^ 2025-05-07T20:05:14.8878135Z 2025-05-07T20:05:14.8878590Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.8879089Z 2025-05-07T20:05:14.8880291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.8882540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.8883454Z ^ 2025-05-07T20:05:14.8883767Z 2025-05-07T20:05:16.8877678Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:05:16.8899794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:16.8902592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:16.8903774Z ^ 2025-05-07T20:05:16.8904461Z 2025-05-07T20:05:16.8904936Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:16.8905604Z 2025-05-07T20:05:16.8907405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:16.8910070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:16.8911279Z ^ 2025-05-07T20:05:16.8911650Z 2025-05-07T20:05:16.8913543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:16.8916133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:16.8917313Z ^ 2025-05-07T20:05:16.8917580Z 2025-05-07T20:05:16.8918018Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:16.8918698Z 2025-05-07T20:05:16.8920505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:16.8923357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:16.8924474Z ^ 2025-05-07T20:05:16.8924897Z 2025-05-07T20:05:16.8926403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:16.8929003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:16.8930107Z ^ 2025-05-07T20:05:16.8930384Z 2025-05-07T20:05:16.8930837Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:16.8931489Z 2025-05-07T20:05:16.8933138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:16.8935803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:16.8937042Z ^ 2025-05-07T20:05:16.8937424Z 2025-05-07T20:05:16.8939059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:16.8941884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:16.8943117Z ^ 2025-05-07T20:05:16.8943399Z 2025-05-07T20:05:16.8943848Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:16.8944413Z 2025-05-07T20:05:16.8946012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:16.8948638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:16.8950040Z ^ 2025-05-07T20:05:16.8950427Z 2025-05-07T20:05:16.8952069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:16.8954888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:16.8956097Z ^ 2025-05-07T20:05:16.8956355Z 2025-05-07T20:05:16.8956816Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:16.8957650Z 2025-05-07T20:05:16.8959352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:16.8961881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:16.8963021Z ^ 2025-05-07T20:05:16.8963387Z 2025-05-07T20:05:20.3817082Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:05:20.3839666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.3842492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.3843541Z ^ 2025-05-07T20:05:20.3843787Z 2025-05-07T20:05:20.3844401Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.3844984Z 2025-05-07T20:05:20.3846457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.3849299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.3850425Z ^ 2025-05-07T20:05:20.3850762Z 2025-05-07T20:05:20.3852211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.3854635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.3855749Z ^ 2025-05-07T20:05:20.3855992Z 2025-05-07T20:05:20.3856448Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.3857064Z 2025-05-07T20:05:20.3858598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.3861295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.3862417Z ^ 2025-05-07T20:05:20.3862766Z 2025-05-07T20:05:20.3864287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.3866836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.3867906Z ^ 2025-05-07T20:05:20.3868169Z 2025-05-07T20:05:20.3868554Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.3869165Z 2025-05-07T20:05:20.3870744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.3873276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.3874441Z ^ 2025-05-07T20:05:20.3874815Z 2025-05-07T20:05:20.3876397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.3878808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.3879899Z ^ 2025-05-07T20:05:20.3880136Z 2025-05-07T20:05:20.3880559Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.3881165Z 2025-05-07T20:05:20.3882576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.3885168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.3886396Z ^ 2025-05-07T20:05:20.3886766Z 2025-05-07T20:05:20.3888276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.3890929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.3892101Z ^ 2025-05-07T20:05:20.3892366Z 2025-05-07T20:05:20.3892779Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.3893428Z 2025-05-07T20:05:20.3894966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.3897488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.3898682Z ^ 2025-05-07T20:05:20.3899035Z 2025-05-07T20:05:21.2888793Z [514/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:05:21.2910808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.2913626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.2914785Z ^ 2025-05-07T20:05:21.2915049Z 2025-05-07T20:05:21.2915511Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:21.2916297Z 2025-05-07T20:05:21.2917976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.2920557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.2921691Z ^ 2025-05-07T20:05:21.2922280Z 2025-05-07T20:05:21.2923747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.2926210Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.2927315Z ^ 2025-05-07T20:05:21.2927593Z 2025-05-07T20:05:21.2928012Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:21.2928590Z 2025-05-07T20:05:21.2930221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.2932821Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.2933841Z ^ 2025-05-07T20:05:21.2934141Z 2025-05-07T20:05:21.2935587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.2937931Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.2939057Z ^ 2025-05-07T20:05:21.2939314Z 2025-05-07T20:05:21.2939927Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:21.2940524Z 2025-05-07T20:05:21.2941992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.2944535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.2945679Z ^ 2025-05-07T20:05:21.2946003Z 2025-05-07T20:05:21.2947468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.2949998Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.2951128Z ^ 2025-05-07T20:05:21.2951684Z 2025-05-07T20:05:21.2952122Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:21.2952754Z 2025-05-07T20:05:21.2954434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.2956927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.2958119Z ^ 2025-05-07T20:05:21.2958475Z 2025-05-07T20:05:21.2960359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.2962832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.2963978Z ^ 2025-05-07T20:05:21.2964225Z 2025-05-07T20:05:21.2964579Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:21.2965228Z 2025-05-07T20:05:21.2966785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.2969298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.2970458Z ^ 2025-05-07T20:05:21.2970857Z 2025-05-07T20:05:23.3260010Z [515/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:05:23.3278046Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:25.8158661Z [516/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:05:29.7071482Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:05:35.9514922Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:05:35.9538791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9541896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9543175Z ^ 2025-05-07T20:05:35.9543467Z 2025-05-07T20:05:35.9543944Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:35.9544667Z 2025-05-07T20:05:35.9546474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9549624Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9550814Z ^ 2025-05-07T20:05:35.9551251Z 2025-05-07T20:05:35.9552731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9555264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9556282Z ^ 2025-05-07T20:05:35.9556639Z 2025-05-07T20:05:35.9557112Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:35.9557794Z 2025-05-07T20:05:35.9559291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9561814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9562904Z ^ 2025-05-07T20:05:35.9563298Z 2025-05-07T20:05:35.9564826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9567489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9568618Z ^ 2025-05-07T20:05:35.9568906Z 2025-05-07T20:05:35.9569362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:35.9570017Z 2025-05-07T20:05:35.9571702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9574387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9575647Z ^ 2025-05-07T20:05:35.9576035Z 2025-05-07T20:05:35.9577747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9580521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9581720Z ^ 2025-05-07T20:05:35.9581979Z 2025-05-07T20:05:35.9582429Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:35.9583136Z 2025-05-07T20:05:35.9584836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9587488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9588695Z ^ 2025-05-07T20:05:35.9589099Z 2025-05-07T20:05:35.9590724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9593569Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9594793Z ^ 2025-05-07T20:05:35.9595045Z 2025-05-07T20:05:35.9595607Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:35.9596300Z 2025-05-07T20:05:35.9597982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9600800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9602031Z ^ 2025-05-07T20:05:35.9602416Z 2025-05-07T20:05:37.5861164Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:05:47.1988592Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:05:48.9661440Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:05:48.9684378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.9687098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.9688556Z ^ 2025-05-07T20:05:48.9688820Z 2025-05-07T20:05:48.9689296Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:48.9690125Z 2025-05-07T20:05:48.9691903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.9694718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.9695962Z ^ 2025-05-07T20:05:48.9696334Z 2025-05-07T20:05:48.9698077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.9700935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.9702387Z ^ 2025-05-07T20:05:48.9702645Z 2025-05-07T20:05:48.9703115Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:48.9703812Z 2025-05-07T20:05:48.9705639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.9708453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.9709696Z ^ 2025-05-07T20:05:48.9710069Z 2025-05-07T20:05:48.9711786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.9714572Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.9715818Z ^ 2025-05-07T20:05:48.9716070Z 2025-05-07T20:05:48.9716531Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:48.9717242Z 2025-05-07T20:05:48.9718994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.9721820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.9723308Z ^ 2025-05-07T20:05:48.9723681Z 2025-05-07T20:05:48.9725414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.9728419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.9729652Z ^ 2025-05-07T20:05:48.9729905Z 2025-05-07T20:05:48.9730362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:48.9731048Z 2025-05-07T20:05:48.9732902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.9735723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.9736971Z ^ 2025-05-07T20:05:48.9737340Z 2025-05-07T20:05:48.9739162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.9742079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.9743284Z ^ 2025-05-07T20:05:48.9743556Z 2025-05-07T20:05:48.9744027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:48.9744720Z 2025-05-07T20:05:48.9746444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.9749208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.9750657Z ^ 2025-05-07T20:05:48.9751029Z 2025-05-07T20:05:49.2224480Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:05:51.7024511Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:05:51.7046219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.7048833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.7049920Z ^ 2025-05-07T20:05:51.7050154Z 2025-05-07T20:05:51.7050603Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.7051229Z 2025-05-07T20:05:51.7052827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.7055176Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.7056357Z ^ 2025-05-07T20:05:51.7056692Z 2025-05-07T20:05:51.7058182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.7060775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.7061823Z ^ 2025-05-07T20:05:51.7062086Z 2025-05-07T20:05:51.7062510Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.7063173Z 2025-05-07T20:05:51.7064903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.7067385Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.7068550Z ^ 2025-05-07T20:05:51.7068902Z 2025-05-07T20:05:51.7070534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.7073130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.7074283Z ^ 2025-05-07T20:05:51.7074658Z 2025-05-07T20:05:51.7075098Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.7075757Z 2025-05-07T20:05:51.7077366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.7080010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.7081167Z ^ 2025-05-07T20:05:51.7081541Z 2025-05-07T20:05:51.7083117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.7085724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.7086802Z ^ 2025-05-07T20:05:51.7087026Z 2025-05-07T20:05:51.7087494Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.7088139Z 2025-05-07T20:05:51.7089760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.7092381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.7093589Z ^ 2025-05-07T20:05:51.7093951Z 2025-05-07T20:05:51.7095519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.7098089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.7099286Z ^ 2025-05-07T20:05:51.7099658Z 2025-05-07T20:05:51.7100046Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.7100652Z 2025-05-07T20:05:51.7102274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.7104881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.7106015Z ^ 2025-05-07T20:05:51.7106375Z 2025-05-07T20:05:56.5643509Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:05:57.0100676Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:05:59.2849803Z [526/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:05:59.2973039Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:05:59.3111122Z [528/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:05:59.6301653Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:06:00.1880779Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:06:00.5922508Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:06:00.9415444Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:06:01.8744686Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:06:02.2792248Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:06:02.2874415Z [535/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:06:02.2876504Z ################################################################################ 2025-05-07T20:06:02.2877182Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.2878089Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:06:02.2878996Z Removing all RPATHs ... 2025-05-07T20:06:02.2879678Z ################################################################################ 2025-05-07T20:06:02.2990909Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 1 2025-05-07T20:06:02.2993226Z ################################################################################ 2025-05-07T20:06:02.2993858Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.2994785Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:06:02.2995694Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:02.2996353Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:02.2997221Z ################################################################################ 2025-05-07T20:06:02.3715089Z [537/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:02.3717516Z ################################################################################ 2025-05-07T20:06:02.3718174Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.3719299Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:02.3720364Z Removing all RPATHs ... 2025-05-07T20:06:02.3720900Z ################################################################################ 2025-05-07T20:06:02.7991890Z [538/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:06:02.8008000Z In file included from tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:1: 2025-05-07T20:06:02.8009988Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:02.8018118Z static void __device_stub__ZN10fbgemm_gpu28unique_indices_length_kernelIlLl9223372036854775807ELln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_S5_S5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArg(__par2, 32UL);__cudaSetupArg(__par3, 48UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::unique_indices_length_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:06:02.8026323Z ^ 2025-05-07T20:06:02.8028460Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:02.8030916Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:02.8033283Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:02.8039975Z static void __device_stub__ZN10fbgemm_gpu24compute_hash_size_kernelIlLln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_lS5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const int64_t __par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArgSimple(__par2, 32UL);__cudaSetupArg(__par3, 40UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const int64_t, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::compute_hash_size_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:06:02.8046610Z ^ 2025-05-07T20:06:02.8048508Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:02.8051007Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:02.8053443Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:55:445: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:02.8056031Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:55:1476: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:02.8057484Z 8 warnings generated. 2025-05-07T20:06:02.8059092Z [539/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:06:02.8061025Z ################################################################################ 2025-05-07T20:06:02.8061528Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.8062365Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:06:02.8063195Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:02.8063739Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:02.8064330Z ################################################################################ 2025-05-07T20:06:02.8066009Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:02.8067646Z ################################################################################ 2025-05-07T20:06:02.8068130Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.8068921Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:02.8069707Z Removing all RPATHs ... 2025-05-07T20:06:02.8070089Z ################################################################################ 2025-05-07T20:06:02.8112723Z [541/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:06:02.8114631Z ################################################################################ 2025-05-07T20:06:02.8115101Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.8115942Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:06:02.8116864Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:02.8117402Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:02.8117980Z ################################################################################ 2025-05-07T20:06:02.8195213Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:06:02.8197233Z ################################################################################ 2025-05-07T20:06:02.8197886Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.8198969Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:06:02.8200061Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:02.8200776Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:02.8201591Z ################################################################################ 2025-05-07T20:06:02.8864254Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:02.8865936Z ################################################################################ 2025-05-07T20:06:02.8866692Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.8867451Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:02.8868183Z Removing all RPATHs ... 2025-05-07T20:06:02.8868567Z ################################################################################ 2025-05-07T20:06:02.8878528Z [544/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:02.8880130Z ################################################################################ 2025-05-07T20:06:02.8880616Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.8881382Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:02.8882201Z Removing all RPATHs ... 2025-05-07T20:06:02.8882609Z ################################################################################ 2025-05-07T20:06:02.9045757Z [545/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:06:02.9047811Z ################################################################################ 2025-05-07T20:06:02.9048445Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.9049596Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:06:02.9050870Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:02.9051612Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:02.9052377Z ################################################################################ 2025-05-07T20:06:02.9168974Z [546/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:06:02.9172050Z ################################################################################ 2025-05-07T20:06:02.9172828Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.9174321Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:06:02.9175730Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:02.9176493Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:02.9177360Z ################################################################################ 2025-05-07T20:06:03.2068383Z [547/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:06:03.2070209Z ################################################################################ 2025-05-07T20:06:03.2070737Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.2071542Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:06:03.2072396Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:03.2072908Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:03.2073518Z ################################################################################ 2025-05-07T20:06:05.1354266Z [548/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:06:05.1373387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:05.1375514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:05.1376456Z ^ 2025-05-07T20:06:05.1376664Z 2025-05-07T20:06:05.1377049Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:05.1377588Z 2025-05-07T20:06:05.1379004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:05.1381267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:05.1382203Z ^ 2025-05-07T20:06:05.1382519Z 2025-05-07T20:06:05.1383863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:05.1385939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:05.1386994Z ^ 2025-05-07T20:06:05.1387195Z 2025-05-07T20:06:05.1387550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:05.1388076Z 2025-05-07T20:06:05.1389375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:05.1391432Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:05.1392383Z ^ 2025-05-07T20:06:05.1392667Z 2025-05-07T20:06:05.1393953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:05.1396017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:05.1396952Z ^ 2025-05-07T20:06:05.1397158Z 2025-05-07T20:06:05.1397533Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:05.1398051Z 2025-05-07T20:06:05.1399338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:05.1401415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:05.1402346Z ^ 2025-05-07T20:06:05.1402657Z 2025-05-07T20:06:05.1403938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:05.1406028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:05.1407041Z ^ 2025-05-07T20:06:05.1407277Z 2025-05-07T20:06:05.1407636Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:05.1408159Z 2025-05-07T20:06:05.1409560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:05.1411630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:05.1412589Z ^ 2025-05-07T20:06:05.1412881Z 2025-05-07T20:06:05.1414245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:05.1416301Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:05.1417229Z ^ 2025-05-07T20:06:05.1417433Z 2025-05-07T20:06:05.1417791Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:05.1418353Z 2025-05-07T20:06:05.1419776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:05.1421868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:05.1423185Z ^ 2025-05-07T20:06:05.1423503Z 2025-05-07T20:06:05.2726545Z [549/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:06:06.7832377Z [550/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:06:07.9140464Z [551/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:06:08.1498942Z [552/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:06:08.7263129Z [553/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:06:08.7322588Z [554/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:08.7325046Z ################################################################################ 2025-05-07T20:06:08.7325711Z [CMAKE] Running post-build script ... 2025-05-07T20:06:08.7326907Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:08.7328126Z Removing all RPATHs ... 2025-05-07T20:06:08.7328617Z ################################################################################ 2025-05-07T20:06:08.7675462Z [555/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:06:08.7696656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7698291Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:08.7698985Z ^ 2025-05-07T20:06:08.7699250Z 2025-05-07T20:06:08.7699779Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:08.7700717Z 2025-05-07T20:06:08.7701640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7703139Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:08.7703823Z ^ 2025-05-07T20:06:08.7704065Z 2025-05-07T20:06:08.7704966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7706465Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:08.7707179Z ^ 2025-05-07T20:06:08.7707392Z 2025-05-07T20:06:08.7708341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7709833Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:08.7710553Z ^ 2025-05-07T20:06:08.7710821Z 2025-05-07T20:06:08.7711754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7713292Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:08.7713997Z ^ 2025-05-07T20:06:08.7714235Z 2025-05-07T20:06:08.7714659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:08.7715305Z 2025-05-07T20:06:08.7716233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7717692Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:08.7718371Z ^ 2025-05-07T20:06:08.7718594Z 2025-05-07T20:06:08.7719494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7720982Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:08.7721840Z ^ 2025-05-07T20:06:08.7722399Z 2025-05-07T20:06:08.7723237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7724645Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:08.7725545Z ^ 2025-05-07T20:06:08.7725767Z 2025-05-07T20:06:08.7726658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7728083Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:08.7728812Z ^ 2025-05-07T20:06:08.7729042Z 2025-05-07T20:06:08.7729498Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:08.7730297Z 2025-05-07T20:06:08.7731218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7732757Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:08.7733421Z ^ 2025-05-07T20:06:08.7733648Z 2025-05-07T20:06:08.7734512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7736081Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:08.7736878Z ^ 2025-05-07T20:06:08.7737126Z 2025-05-07T20:06:08.7738068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7739753Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:08.7740488Z ^ 2025-05-07T20:06:08.7740731Z 2025-05-07T20:06:08.7741652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7743141Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:08.7743871Z ^ 2025-05-07T20:06:08.7744112Z 2025-05-07T20:06:08.7744560Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:08.7745275Z 2025-05-07T20:06:08.7746194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7747652Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:08.7748350Z ^ 2025-05-07T20:06:08.7748581Z 2025-05-07T20:06:08.7749542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7751052Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:08.7751812Z ^ 2025-05-07T20:06:08.7752050Z 2025-05-07T20:06:08.7752974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7754547Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:08.7755300Z ^ 2025-05-07T20:06:08.7755543Z 2025-05-07T20:06:08.7756472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7757977Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:08.7758711Z ^ 2025-05-07T20:06:08.7758989Z 2025-05-07T20:06:08.7759431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:08.7760069Z 2025-05-07T20:06:08.7760957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7762598Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:08.7763279Z ^ 2025-05-07T20:06:08.7763513Z 2025-05-07T20:06:08.7764503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7766051Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:08.7766690Z ^ 2025-05-07T20:06:08.7766885Z 2025-05-07T20:06:08.7767616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:08.7768810Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:08.7769517Z ^ 2025-05-07T20:06:08.7769741Z 2025-05-07T20:06:08.8397695Z [556/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:06:09.6592224Z [557/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:06:11.1954514Z [558/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:06:11.3027194Z [559/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:06:11.3039882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:11.3041264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:11.3041890Z ^ 2025-05-07T20:06:11.3042028Z 2025-05-07T20:06:11.3042284Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:11.3042633Z 2025-05-07T20:06:11.3043493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:11.3044890Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:11.3045601Z ^ 2025-05-07T20:06:11.3045796Z 2025-05-07T20:06:11.3046646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:11.3048932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:11.3049552Z ^ 2025-05-07T20:06:11.3049703Z 2025-05-07T20:06:11.3049940Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:11.3050292Z 2025-05-07T20:06:11.3051203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:11.3052582Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:11.3065062Z ^ 2025-05-07T20:06:11.3065318Z 2025-05-07T20:06:11.3066195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:11.3067598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:11.3068214Z ^ 2025-05-07T20:06:11.3068354Z 2025-05-07T20:06:11.3068613Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:11.3069093Z 2025-05-07T20:06:11.3069958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:11.3071364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:11.3072002Z ^ 2025-05-07T20:06:11.3072199Z 2025-05-07T20:06:11.3073046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:11.3074437Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:11.3075063Z ^ 2025-05-07T20:06:11.3075201Z 2025-05-07T20:06:11.3075437Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:11.3075794Z 2025-05-07T20:06:11.3076665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:11.3078039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:11.3078668Z ^ 2025-05-07T20:06:11.3078861Z 2025-05-07T20:06:11.3079718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:11.3081100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:11.3081718Z ^ 2025-05-07T20:06:11.3081913Z 2025-05-07T20:06:11.3082159Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:11.3082505Z 2025-05-07T20:06:11.3083405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:11.3084794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:11.3085422Z ^ 2025-05-07T20:06:11.3085611Z 2025-05-07T20:06:11.5252970Z [560/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:06:14.1347354Z [561/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:06:14.7103307Z [562/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:06:14.7114771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7115716Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:14.7116157Z ^ 2025-05-07T20:06:14.7116321Z 2025-05-07T20:06:14.7116650Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:14.7117010Z 2025-05-07T20:06:14.7117540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7118395Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:14.7118813Z ^ 2025-05-07T20:06:14.7118953Z 2025-05-07T20:06:14.7119551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7120435Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:14.7120853Z ^ 2025-05-07T20:06:14.7121016Z 2025-05-07T20:06:14.7121555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7122656Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:14.7123084Z ^ 2025-05-07T20:06:14.7123254Z 2025-05-07T20:06:14.7123918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7124824Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:14.7125382Z ^ 2025-05-07T20:06:14.7125521Z 2025-05-07T20:06:14.7126069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7126939Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:14.7127398Z ^ 2025-05-07T20:06:14.7127537Z 2025-05-07T20:06:14.7127784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:14.7128166Z 2025-05-07T20:06:14.7128692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7129540Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:14.7129920Z ^ 2025-05-07T20:06:14.7130086Z 2025-05-07T20:06:14.7130620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7131463Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:14.7131904Z ^ 2025-05-07T20:06:14.7132038Z 2025-05-07T20:06:14.7132592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7133452Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:14.7133893Z ^ 2025-05-07T20:06:14.7134031Z 2025-05-07T20:06:14.7134559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7135482Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:14.7135966Z ^ 2025-05-07T20:06:14.7136108Z 2025-05-07T20:06:14.7136631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7137573Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:14.7137999Z ^ 2025-05-07T20:06:14.7138159Z 2025-05-07T20:06:14.7138406Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:14.7138764Z 2025-05-07T20:06:14.7139382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7140302Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:14.7140717Z ^ 2025-05-07T20:06:14.7140855Z 2025-05-07T20:06:14.7141381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7142305Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:14.7142740Z ^ 2025-05-07T20:06:14.7142874Z 2025-05-07T20:06:14.7143401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7144281Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:14.7144694Z ^ 2025-05-07T20:06:14.7144858Z 2025-05-07T20:06:14.7145381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7146292Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:14.7146747Z ^ 2025-05-07T20:06:14.7146912Z 2025-05-07T20:06:14.7147463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7148315Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:14.7148758Z ^ 2025-05-07T20:06:14.7148893Z 2025-05-07T20:06:14.7149162Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:14.7149518Z 2025-05-07T20:06:14.7150040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7150886Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:14.7151277Z ^ 2025-05-07T20:06:14.7151435Z 2025-05-07T20:06:14.7151967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7152846Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:14.7153255Z ^ 2025-05-07T20:06:14.7153394Z 2025-05-07T20:06:14.7153947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7154812Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:14.7155257Z ^ 2025-05-07T20:06:14.7155392Z 2025-05-07T20:06:14.7155941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7156833Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:14.7157315Z ^ 2025-05-07T20:06:14.7157448Z 2025-05-07T20:06:14.7157973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7158856Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:14.7159347Z ^ 2025-05-07T20:06:14.7159481Z 2025-05-07T20:06:14.7159725Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:14.7160078Z 2025-05-07T20:06:14.7160627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7161475Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:14.7161886Z ^ 2025-05-07T20:06:14.7162020Z 2025-05-07T20:06:14.7162572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7163419Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:14.7163861Z ^ 2025-05-07T20:06:14.7164118Z 2025-05-07T20:06:14.7164644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7165539Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:14.7165991Z ^ 2025-05-07T20:06:14.7166127Z 2025-05-07T20:06:14.7166662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:14.7167582Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:14.7168049Z ^ 2025-05-07T20:06:14.7168207Z 2025-05-07T20:06:16.8697937Z [563/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:06:16.8710487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.8712011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.8712679Z ^ 2025-05-07T20:06:16.8712880Z 2025-05-07T20:06:16.8713130Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.8713528Z 2025-05-07T20:06:16.8715093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.8716555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.8717202Z ^ 2025-05-07T20:06:16.8717432Z 2025-05-07T20:06:16.8718299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.8719722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.8720350Z ^ 2025-05-07T20:06:16.8720501Z 2025-05-07T20:06:16.8720830Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.8721198Z 2025-05-07T20:06:16.8722281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.8723718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.8724392Z ^ 2025-05-07T20:06:16.8724596Z 2025-05-07T20:06:16.8725452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.8726854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.8727510Z ^ 2025-05-07T20:06:16.8727659Z 2025-05-07T20:06:16.8727910Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.8728272Z 2025-05-07T20:06:16.8729160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.8730552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.8731212Z ^ 2025-05-07T20:06:16.8731419Z 2025-05-07T20:06:16.8732300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.8733689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.8734426Z ^ 2025-05-07T20:06:16.8734574Z 2025-05-07T20:06:16.8734849Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.8735210Z 2025-05-07T20:06:16.8736124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.8737555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.8738192Z ^ 2025-05-07T20:06:16.8738415Z 2025-05-07T20:06:16.8739318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.8740819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.8741448Z ^ 2025-05-07T20:06:16.8741617Z 2025-05-07T20:06:16.8741866Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.8742225Z 2025-05-07T20:06:16.8743120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.8744529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.8745238Z ^ 2025-05-07T20:06:16.8745445Z 2025-05-07T20:06:17.5102970Z [564/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:06:17.5562495Z [565/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:06:17.5564002Z ################################################################################ 2025-05-07T20:06:17.5564372Z [CMAKE] Running post-build script ... 2025-05-07T20:06:17.5565033Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:06:17.5565636Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:17.5566046Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:17.5566468Z ################################################################################ 2025-05-07T20:06:17.9762809Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:06:17.9775555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:17.9776998Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:17.9777632Z ^ 2025-05-07T20:06:17.9777823Z 2025-05-07T20:06:17.9778156Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:17.9778521Z 2025-05-07T20:06:17.9779416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:17.9780950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:17.9781631Z ^ 2025-05-07T20:06:17.9781836Z 2025-05-07T20:06:17.9782732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:17.9784117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:17.9784825Z ^ 2025-05-07T20:06:17.9784974Z 2025-05-07T20:06:17.9785224Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:17.9785620Z 2025-05-07T20:06:17.9786494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:17.9787917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:17.9788562Z ^ 2025-05-07T20:06:17.9788791Z 2025-05-07T20:06:17.9789643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:17.9791048Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:17.9791702Z ^ 2025-05-07T20:06:17.9791852Z 2025-05-07T20:06:17.9792099Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:17.9792477Z 2025-05-07T20:06:17.9793339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:17.9794761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:17.9795403Z ^ 2025-05-07T20:06:17.9795606Z 2025-05-07T20:06:17.9796486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:17.9797933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:17.9798582Z ^ 2025-05-07T20:06:17.9798731Z 2025-05-07T20:06:17.9798999Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:17.9799355Z 2025-05-07T20:06:17.9800266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:17.9801678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:17.9802334Z ^ 2025-05-07T20:06:17.9802538Z 2025-05-07T20:06:17.9803424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:17.9804832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:17.9805455Z ^ 2025-05-07T20:06:17.9805629Z 2025-05-07T20:06:17.9805876Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:17.9806230Z 2025-05-07T20:06:17.9807108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:17.9808538Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:17.9809193Z ^ 2025-05-07T20:06:17.9809391Z 2025-05-07T20:06:19.1400679Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:06:19.1421269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.1423073Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:19.1423885Z ^ 2025-05-07T20:06:19.1424153Z 2025-05-07T20:06:19.1424822Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:19.1425502Z 2025-05-07T20:06:19.1426463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.1428031Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:19.1428810Z ^ 2025-05-07T20:06:19.1429061Z 2025-05-07T20:06:19.1430070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.1431663Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:19.1432448Z ^ 2025-05-07T20:06:19.1432705Z 2025-05-07T20:06:19.1433156Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:19.1434119Z 2025-05-07T20:06:19.1435076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.1436676Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:19.1437457Z ^ 2025-05-07T20:06:19.1437710Z 2025-05-07T20:06:19.1438665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.1440251Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:19.1441036Z ^ 2025-05-07T20:06:19.1441305Z 2025-05-07T20:06:19.1441721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:19.1442362Z 2025-05-07T20:06:19.1443338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.1444916Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:19.1445694Z ^ 2025-05-07T20:06:19.1445928Z 2025-05-07T20:06:19.1446883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.1448476Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:19.1449247Z ^ 2025-05-07T20:06:19.1449505Z 2025-05-07T20:06:19.1449948Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:19.1450564Z 2025-05-07T20:06:19.1451530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.1453044Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:19.1453858Z ^ 2025-05-07T20:06:19.1454103Z 2025-05-07T20:06:19.1455064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.1456863Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:19.1457605Z ^ 2025-05-07T20:06:19.1457833Z 2025-05-07T20:06:19.1458265Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:19.1458928Z 2025-05-07T20:06:19.1460171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.1461714Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:19.1462494Z ^ 2025-05-07T20:06:19.1462742Z 2025-05-07T20:06:22.6957536Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:06:26.9933365Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:06:26.9951686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9953066Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:26.9953735Z ^ 2025-05-07T20:06:26.9954153Z 2025-05-07T20:06:26.9954597Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:26.9955153Z 2025-05-07T20:06:26.9955949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9957271Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:26.9957916Z ^ 2025-05-07T20:06:26.9958123Z 2025-05-07T20:06:26.9959000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9960401Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:26.9961038Z ^ 2025-05-07T20:06:26.9961269Z 2025-05-07T20:06:26.9962123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9963594Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:26.9964307Z ^ 2025-05-07T20:06:26.9964512Z 2025-05-07T20:06:26.9965389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9966717Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:26.9967514Z ^ 2025-05-07T20:06:26.9967733Z 2025-05-07T20:06:26.9968585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9969906Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:26.9970597Z ^ 2025-05-07T20:06:26.9970797Z 2025-05-07T20:06:26.9971225Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:26.9971800Z 2025-05-07T20:06:26.9972667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9974087Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:26.9974718Z ^ 2025-05-07T20:06:26.9974942Z 2025-05-07T20:06:26.9975783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9977255Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:26.9977948Z ^ 2025-05-07T20:06:26.9978148Z 2025-05-07T20:06:26.9979028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9980518Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:26.9981224Z ^ 2025-05-07T20:06:26.9981548Z 2025-05-07T20:06:26.9982398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9983816Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:26.9984495Z ^ 2025-05-07T20:06:26.9984682Z 2025-05-07T20:06:26.9985505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9986866Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:26.9987507Z ^ 2025-05-07T20:06:26.9987708Z 2025-05-07T20:06:26.9988086Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:26.9988693Z 2025-05-07T20:06:26.9989651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9990977Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:26.9991599Z ^ 2025-05-07T20:06:26.9991801Z 2025-05-07T20:06:26.9992672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9993966Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:26.9994615Z ^ 2025-05-07T20:06:26.9994832Z 2025-05-07T20:06:26.9995640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:26.9997004Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:26.9997677Z ^ 2025-05-07T20:06:26.9997930Z 2025-05-07T20:06:26.9998760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:27.0000164Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:27.0000832Z ^ 2025-05-07T20:06:27.0001066Z 2025-05-07T20:06:27.0001891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:27.0003221Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:27.0003857Z ^ 2025-05-07T20:06:27.0004047Z 2025-05-07T20:06:27.0004420Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:27.0004966Z 2025-05-07T20:06:27.0005835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:27.0007160Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:27.0007756Z ^ 2025-05-07T20:06:27.0008108Z 2025-05-07T20:06:27.0008933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:27.0010288Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:27.0010920Z ^ 2025-05-07T20:06:27.0011144Z 2025-05-07T20:06:27.0012061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:27.0013425Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:27.0014113Z ^ 2025-05-07T20:06:27.0014315Z 2025-05-07T20:06:27.0015255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:27.0016615Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:27.0017340Z ^ 2025-05-07T20:06:27.0017535Z 2025-05-07T20:06:27.0018293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:27.0019675Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:27.0020453Z ^ 2025-05-07T20:06:27.0020694Z 2025-05-07T20:06:27.0021053Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:27.0021648Z 2025-05-07T20:06:27.0022698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:27.0024179Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:27.0024791Z ^ 2025-05-07T20:06:27.0024988Z 2025-05-07T20:06:27.0025853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:27.0027218Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:27.0027901Z ^ 2025-05-07T20:06:27.0028123Z 2025-05-07T20:06:27.0028961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:27.0030329Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:27.0031050Z ^ 2025-05-07T20:06:27.0031249Z 2025-05-07T20:06:27.0032110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:27.0033550Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:27.0034200Z ^ 2025-05-07T20:06:27.0034458Z 2025-05-07T20:06:27.1286510Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:06:28.7136681Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:06:29.9009160Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:06:29.9030700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.9032350Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.9033159Z ^ 2025-05-07T20:06:29.9033430Z 2025-05-07T20:06:29.9033884Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.9034572Z 2025-05-07T20:06:29.9035490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.9037079Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.9037923Z ^ 2025-05-07T20:06:29.9038178Z 2025-05-07T20:06:29.9038643Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.9039346Z 2025-05-07T20:06:29.9040280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.9041892Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.9042678Z ^ 2025-05-07T20:06:29.9042953Z 2025-05-07T20:06:29.9043478Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.9044161Z 2025-05-07T20:06:29.9045052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.9046650Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.9047502Z ^ 2025-05-07T20:06:29.9047749Z 2025-05-07T20:06:29.9048199Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.9049146Z 2025-05-07T20:06:29.9050086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.9051690Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.9052583Z ^ 2025-05-07T20:06:29.9052828Z 2025-05-07T20:06:29.9053301Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.9053941Z 2025-05-07T20:06:36.7674541Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:06:40.4245416Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:06:40.4265069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.4267262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.4268497Z ^ 2025-05-07T20:06:40.4268721Z 2025-05-07T20:06:40.4269115Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.4269776Z 2025-05-07T20:06:40.4271154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.4273389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.4274320Z ^ 2025-05-07T20:06:40.4274635Z 2025-05-07T20:06:40.4275928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.4278280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.4279208Z ^ 2025-05-07T20:06:40.4279467Z 2025-05-07T20:06:40.4279891Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.4280446Z 2025-05-07T20:06:40.4281905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.4284170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.4285146Z ^ 2025-05-07T20:06:40.4285438Z 2025-05-07T20:06:40.4286768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.4289084Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.4290160Z ^ 2025-05-07T20:06:40.4290372Z 2025-05-07T20:06:40.4290815Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.4291412Z 2025-05-07T20:06:40.4292857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.4295062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.4296000Z ^ 2025-05-07T20:06:40.4296323Z 2025-05-07T20:06:40.4297804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.4300130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.4301077Z ^ 2025-05-07T20:06:40.4301354Z 2025-05-07T20:06:40.4301777Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.4302371Z 2025-05-07T20:06:40.4303752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.4305863Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.4306925Z ^ 2025-05-07T20:06:40.4307242Z 2025-05-07T20:06:40.4308661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.4310978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.4311975Z ^ 2025-05-07T20:06:40.4312237Z 2025-05-07T20:06:40.4312607Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.4313145Z 2025-05-07T20:06:40.4314514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.4316656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.4317656Z ^ 2025-05-07T20:06:40.4318044Z 2025-05-07T20:06:40.5614489Z [575/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:06:40.5633660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.5638601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.5639781Z ^ 2025-05-07T20:06:40.5640080Z 2025-05-07T20:06:40.5640538Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.5641200Z 2025-05-07T20:06:40.5642836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.5645610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.5646865Z ^ 2025-05-07T20:06:40.5647239Z 2025-05-07T20:06:40.5648965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.5651649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.5652906Z ^ 2025-05-07T20:06:40.5653172Z 2025-05-07T20:06:40.5653643Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.5654365Z 2025-05-07T20:06:40.5656122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.5659025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.5660342Z ^ 2025-05-07T20:06:40.5660752Z 2025-05-07T20:06:40.5661828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:40.5663556Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:40.5664172Z ^ 2025-05-07T20:06:40.5664513Z 2025-05-07T20:06:40.5666319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.5668923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.5670138Z ^ 2025-05-07T20:06:40.5670403Z 2025-05-07T20:06:40.5670883Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.5671519Z 2025-05-07T20:06:40.5673369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.5676087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.5677277Z ^ 2025-05-07T20:06:40.5677649Z 2025-05-07T20:06:40.5678625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:40.5679954Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:40.5680570Z ^ 2025-05-07T20:06:40.5680869Z 2025-05-07T20:06:40.5682451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.5685311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.5686534Z ^ 2025-05-07T20:06:40.5686826Z 2025-05-07T20:06:40.5687295Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.5687968Z 2025-05-07T20:06:40.5689695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.5692417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.5693683Z ^ 2025-05-07T20:06:40.5694056Z 2025-05-07T20:06:40.5695133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:40.5696643Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:40.5697273Z ^ 2025-05-07T20:06:40.5697580Z 2025-05-07T20:06:40.5699296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.5702119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.5703340Z ^ 2025-05-07T20:06:40.5703595Z 2025-05-07T20:06:40.5704053Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.5704758Z 2025-05-07T20:06:40.5706494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.5709326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.5710530Z ^ 2025-05-07T20:06:40.5710894Z 2025-05-07T20:06:40.5712053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:40.5713541Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:40.5714174Z ^ 2025-05-07T20:06:40.5714480Z 2025-05-07T20:06:43.9197174Z [576/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:06:43.9217977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.9220618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.9221681Z ^ 2025-05-07T20:06:43.9222195Z 2025-05-07T20:06:43.9222638Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.9223260Z 2025-05-07T20:06:43.9224847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.9227359Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.9228783Z ^ 2025-05-07T20:06:43.9229134Z 2025-05-07T20:06:43.9230817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.9233381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.9234518Z ^ 2025-05-07T20:06:43.9234763Z 2025-05-07T20:06:43.9235181Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.9235845Z 2025-05-07T20:06:43.9237580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.9240160Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.9241265Z ^ 2025-05-07T20:06:43.9241631Z 2025-05-07T20:06:43.9243209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.9245773Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.9247039Z ^ 2025-05-07T20:06:43.9247286Z 2025-05-07T20:06:43.9247738Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.9248366Z 2025-05-07T20:06:43.9249957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.9252417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.9253456Z ^ 2025-05-07T20:06:43.9253798Z 2025-05-07T20:06:43.9255337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.9257823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.9258896Z ^ 2025-05-07T20:06:43.9259145Z 2025-05-07T20:06:43.9259566Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.9260288Z 2025-05-07T20:06:43.9261834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.9264401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.9265517Z ^ 2025-05-07T20:06:43.9265843Z 2025-05-07T20:06:43.9267322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.9269718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.9271006Z ^ 2025-05-07T20:06:43.9271261Z 2025-05-07T20:06:43.9271702Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.9272317Z 2025-05-07T20:06:43.9273922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.9276485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.9277602Z ^ 2025-05-07T20:06:43.9277961Z 2025-05-07T20:06:47.2434132Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:06:47.2457069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.2459891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.2461157Z ^ 2025-05-07T20:06:47.2461424Z 2025-05-07T20:06:47.2461899Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.2462589Z 2025-05-07T20:06:47.2464303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.2467290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.2468489Z ^ 2025-05-07T20:06:47.2468870Z 2025-05-07T20:06:47.2470659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.2473386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.2474576Z ^ 2025-05-07T20:06:47.2474842Z 2025-05-07T20:06:47.2475367Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.2476047Z 2025-05-07T20:06:47.2477763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.2480482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.2481703Z ^ 2025-05-07T20:06:47.2482073Z 2025-05-07T20:06:47.2483768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.2486550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.2487728Z ^ 2025-05-07T20:06:47.2487990Z 2025-05-07T20:06:47.2488434Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.2489108Z 2025-05-07T20:06:47.2490808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.2493535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.2494730Z ^ 2025-05-07T20:06:47.2495104Z 2025-05-07T20:06:47.2496780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.2499508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.2500737Z ^ 2025-05-07T20:06:47.2500994Z 2025-05-07T20:06:47.2501466Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.2502133Z 2025-05-07T20:06:47.2503841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.2506577Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.2507780Z ^ 2025-05-07T20:06:47.2508151Z 2025-05-07T20:06:47.2509833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.2512665Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.2513851Z ^ 2025-05-07T20:06:47.2514097Z 2025-05-07T20:06:47.2514615Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.2515298Z 2025-05-07T20:06:47.2516990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.2519732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.2521012Z ^ 2025-05-07T20:06:47.2521374Z 2025-05-07T20:06:49.8840082Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:06:49.8862169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.8864686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.8865765Z ^ 2025-05-07T20:06:49.8866007Z 2025-05-07T20:06:49.8866440Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:49.8867286Z 2025-05-07T20:06:49.8868932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.8871444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.8872596Z ^ 2025-05-07T20:06:49.8872947Z 2025-05-07T20:06:49.8874492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.8877217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.8878383Z ^ 2025-05-07T20:06:49.8878642Z 2025-05-07T20:06:49.8879103Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:49.8879764Z 2025-05-07T20:06:49.8881394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.8883970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.8885157Z ^ 2025-05-07T20:06:49.8885509Z 2025-05-07T20:06:49.8887171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.8889879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.8891033Z ^ 2025-05-07T20:06:49.8891281Z 2025-05-07T20:06:49.8891716Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:49.8892379Z 2025-05-07T20:06:49.8893901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.8896471Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.8897584Z ^ 2025-05-07T20:06:49.8897933Z 2025-05-07T20:06:49.8899442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.8902083Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.8903193Z ^ 2025-05-07T20:06:49.8903431Z 2025-05-07T20:06:49.8903881Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:49.8904523Z 2025-05-07T20:06:49.8906160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.8908662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.8909784Z ^ 2025-05-07T20:06:49.8910115Z 2025-05-07T20:06:49.8911623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.8914278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.8915465Z ^ 2025-05-07T20:06:49.8915706Z 2025-05-07T20:06:49.8916128Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:49.8916765Z 2025-05-07T20:06:49.8918409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.8921050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.8922366Z ^ 2025-05-07T20:06:49.8922727Z 2025-05-07T20:06:57.0511792Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:06:57.0533930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0536532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0537638Z ^ 2025-05-07T20:06:57.0538103Z 2025-05-07T20:06:57.0538545Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:57.0539177Z 2025-05-07T20:06:57.0541120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0543658Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0544788Z ^ 2025-05-07T20:06:57.0545133Z 2025-05-07T20:06:57.0546934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0549494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0550641Z ^ 2025-05-07T20:06:57.0550890Z 2025-05-07T20:06:57.0551311Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:57.0551906Z 2025-05-07T20:06:57.0553460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0555995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0557171Z ^ 2025-05-07T20:06:57.0557699Z 2025-05-07T20:06:57.0559241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0561780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0562813Z ^ 2025-05-07T20:06:57.0563054Z 2025-05-07T20:06:57.0563453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:57.0564052Z 2025-05-07T20:06:57.0565557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0568033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0569118Z ^ 2025-05-07T20:06:57.0569443Z 2025-05-07T20:06:57.0570930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0573458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0574661Z ^ 2025-05-07T20:06:57.0574902Z 2025-05-07T20:06:57.0575324Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:57.0576048Z 2025-05-07T20:06:57.0577616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0580260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0581538Z ^ 2025-05-07T20:06:57.0581900Z 2025-05-07T20:06:57.0583483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0586208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0587301Z ^ 2025-05-07T20:06:57.0587560Z 2025-05-07T20:06:57.0587983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:57.0588591Z 2025-05-07T20:06:57.0590288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0592877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0594074Z ^ 2025-05-07T20:06:57.0594436Z 2025-05-07T20:07:00.2298123Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:07:00.2318297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.2320774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.2322325Z ^ 2025-05-07T20:07:00.2322572Z 2025-05-07T20:07:00.2323013Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.2323553Z 2025-05-07T20:07:00.2324990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.2327489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.2328609Z ^ 2025-05-07T20:07:00.2328994Z 2025-05-07T20:07:00.2330603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.2332997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.2334103Z ^ 2025-05-07T20:07:00.2334359Z 2025-05-07T20:07:00.2334792Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.2335446Z 2025-05-07T20:07:00.2337017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.2339498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.2340654Z ^ 2025-05-07T20:07:00.2340918Z 2025-05-07T20:07:00.2342252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.2344497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.2345640Z ^ 2025-05-07T20:07:00.2345887Z 2025-05-07T20:07:00.2346337Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.2346939Z 2025-05-07T20:07:00.2348492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.2350737Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.2351596Z ^ 2025-05-07T20:07:00.2351929Z 2025-05-07T20:07:00.2353384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.2355739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.2356836Z ^ 2025-05-07T20:07:00.2357068Z 2025-05-07T20:07:00.2357427Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.2358033Z 2025-05-07T20:07:00.2359520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.2361981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.2363253Z ^ 2025-05-07T20:07:00.2363581Z 2025-05-07T20:07:00.2365217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.2367639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.2368660Z ^ 2025-05-07T20:07:00.2368904Z 2025-05-07T20:07:00.2369332Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.2369896Z 2025-05-07T20:07:00.2371519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.2374012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.2375142Z ^ 2025-05-07T20:07:00.2375535Z 2025-05-07T20:07:01.1681839Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:07:01.1702653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1707155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1708277Z ^ 2025-05-07T20:07:01.1708514Z 2025-05-07T20:07:01.1708938Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.1709587Z 2025-05-07T20:07:01.1711385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1714016Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1715213Z ^ 2025-05-07T20:07:01.1715579Z 2025-05-07T20:07:01.1717286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1719975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1721139Z ^ 2025-05-07T20:07:01.1721388Z 2025-05-07T20:07:01.1721831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.1722691Z 2025-05-07T20:07:01.1724317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1727100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1728294Z ^ 2025-05-07T20:07:01.1728695Z 2025-05-07T20:07:01.1730355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1733010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1734142Z ^ 2025-05-07T20:07:01.1734424Z 2025-05-07T20:07:01.1734887Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.1735581Z 2025-05-07T20:07:01.1737335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1740001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1741187Z ^ 2025-05-07T20:07:01.1741549Z 2025-05-07T20:07:01.1743170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1745703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1746847Z ^ 2025-05-07T20:07:01.1747092Z 2025-05-07T20:07:01.1747537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.1748224Z 2025-05-07T20:07:01.1749833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1752524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1753670Z ^ 2025-05-07T20:07:01.1754053Z 2025-05-07T20:07:01.1755770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1758322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1759432Z ^ 2025-05-07T20:07:01.1759689Z 2025-05-07T20:07:01.1760241Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.1760895Z 2025-05-07T20:07:01.1762499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1765071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1766306Z ^ 2025-05-07T20:07:01.1766677Z 2025-05-07T20:07:01.6114992Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:07:01.6134494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.6137232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.6138323Z ^ 2025-05-07T20:07:01.6138590Z 2025-05-07T20:07:01.6139146Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.6139832Z 2025-05-07T20:07:01.6141362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.6143972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.6145053Z ^ 2025-05-07T20:07:01.6145378Z 2025-05-07T20:07:01.6146861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.6149113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.6150172Z ^ 2025-05-07T20:07:01.6150390Z 2025-05-07T20:07:01.6150793Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.6151396Z 2025-05-07T20:07:01.6152786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.6155259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.6156270Z ^ 2025-05-07T20:07:01.6156605Z 2025-05-07T20:07:01.6157844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.6159831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.6160842Z ^ 2025-05-07T20:07:01.6161068Z 2025-05-07T20:07:01.6161378Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.6161911Z 2025-05-07T20:07:01.6163383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.6165316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.6166237Z ^ 2025-05-07T20:07:01.6166554Z 2025-05-07T20:07:01.6168002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.6170348Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.6171371Z ^ 2025-05-07T20:07:01.6171626Z 2025-05-07T20:07:01.6172008Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.6172572Z 2025-05-07T20:07:01.6173930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.6176385Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.6177510Z ^ 2025-05-07T20:07:01.6177874Z 2025-05-07T20:07:01.6179354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.6181638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.6182639Z ^ 2025-05-07T20:07:01.6182875Z 2025-05-07T20:07:01.6183306Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.6183902Z 2025-05-07T20:07:01.6185364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.6187662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.6188748Z ^ 2025-05-07T20:07:01.6189083Z 2025-05-07T20:07:02.6557907Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:07:02.6574586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.6577573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.6578884Z ^ 2025-05-07T20:07:02.6579151Z 2025-05-07T20:07:02.6579718Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.6580428Z 2025-05-07T20:07:02.6582228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.6585127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.6586323Z ^ 2025-05-07T20:07:02.6586738Z 2025-05-07T20:07:02.6588435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.6591233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.6592458Z ^ 2025-05-07T20:07:02.6592727Z 2025-05-07T20:07:02.6593222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.6593880Z 2025-05-07T20:07:02.6595590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.6598567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.6599850Z ^ 2025-05-07T20:07:02.6600266Z 2025-05-07T20:07:02.6602112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.6604902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.6606115Z ^ 2025-05-07T20:07:02.6606392Z 2025-05-07T20:07:02.6606883Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.6607648Z 2025-05-07T20:07:02.6609537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.6611745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.6612773Z ^ 2025-05-07T20:07:02.6613107Z 2025-05-07T20:07:02.6614289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.6616373Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.6617396Z ^ 2025-05-07T20:07:02.6617618Z 2025-05-07T20:07:02.6618057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.6618680Z 2025-05-07T20:07:02.6620008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.6623057Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.6624231Z ^ 2025-05-07T20:07:02.6624595Z 2025-05-07T20:07:02.6626045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.6628599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.6629572Z ^ 2025-05-07T20:07:02.6629825Z 2025-05-07T20:07:02.6630230Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.6630829Z 2025-05-07T20:07:02.6632435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.6634947Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.6636054Z ^ 2025-05-07T20:07:02.6636407Z 2025-05-07T20:07:09.4663450Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:07:09.4675070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.4676535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.4677188Z ^ 2025-05-07T20:07:09.4677339Z 2025-05-07T20:07:09.4677592Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.4677979Z 2025-05-07T20:07:09.4678913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.4680350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.4680988Z ^ 2025-05-07T20:07:09.4681213Z 2025-05-07T20:07:09.4682066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.4683472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.4684093Z ^ 2025-05-07T20:07:09.4684261Z 2025-05-07T20:07:09.4684509Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.4684902Z 2025-05-07T20:07:09.4685785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.4687179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.4687832Z ^ 2025-05-07T20:07:09.4688038Z 2025-05-07T20:07:09.4688884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.4690282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.4690920Z ^ 2025-05-07T20:07:09.4691071Z 2025-05-07T20:07:09.4691314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.4691693Z 2025-05-07T20:07:09.4692558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.4693970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.4694600Z ^ 2025-05-07T20:07:09.4694818Z 2025-05-07T20:07:09.4695672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.4697068Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.4697687Z ^ 2025-05-07T20:07:09.4697870Z 2025-05-07T20:07:09.4698135Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.4698489Z 2025-05-07T20:07:09.4699394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.4700887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.4701536Z ^ 2025-05-07T20:07:09.4701737Z 2025-05-07T20:07:09.4702625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.4704035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.4704685Z ^ 2025-05-07T20:07:09.4704828Z 2025-05-07T20:07:09.4705071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.4705447Z 2025-05-07T20:07:09.4706307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.4707726Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.4708361Z ^ 2025-05-07T20:07:09.4708592Z 2025-05-07T20:07:09.6291864Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:07:09.6304934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.6306409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.6307045Z ^ 2025-05-07T20:07:09.6307220Z 2025-05-07T20:07:09.6307509Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.6307895Z 2025-05-07T20:07:09.6308850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.6310283Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.6310933Z ^ 2025-05-07T20:07:09.6311137Z 2025-05-07T20:07:09.6312027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.6313424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.6314126Z ^ 2025-05-07T20:07:09.6314271Z 2025-05-07T20:07:09.6314539Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.6314903Z 2025-05-07T20:07:09.6315774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.6317194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.6317857Z ^ 2025-05-07T20:07:09.6318057Z 2025-05-07T20:07:09.6328215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.6329790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.6330454Z ^ 2025-05-07T20:07:09.6330607Z 2025-05-07T20:07:09.6330872Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.6331257Z 2025-05-07T20:07:09.6332132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.6333558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.6334202Z ^ 2025-05-07T20:07:09.6334428Z 2025-05-07T20:07:09.6335282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.6336705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.6337464Z ^ 2025-05-07T20:07:09.6337635Z 2025-05-07T20:07:09.6337882Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.6338248Z 2025-05-07T20:07:09.6339203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.6340695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.6341361Z ^ 2025-05-07T20:07:09.6341566Z 2025-05-07T20:07:09.6342482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.6343887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.6344541Z ^ 2025-05-07T20:07:09.6344691Z 2025-05-07T20:07:09.6344946Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.6345331Z 2025-05-07T20:07:09.6346200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.6347616Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.6348315Z ^ 2025-05-07T20:07:09.6348537Z 2025-05-07T20:07:10.8988455Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:07:10.8999989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.9001400Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.9002060Z ^ 2025-05-07T20:07:10.9002212Z 2025-05-07T20:07:10.9002555Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:10.9002916Z 2025-05-07T20:07:10.9003785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.9005210Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.9005884Z ^ 2025-05-07T20:07:10.9006086Z 2025-05-07T20:07:10.9006941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.9008347Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.9009024Z ^ 2025-05-07T20:07:10.9009198Z 2025-05-07T20:07:10.9009450Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:10.9009807Z 2025-05-07T20:07:10.9010702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.9012158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.9012802Z ^ 2025-05-07T20:07:10.9013003Z 2025-05-07T20:07:10.9013890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.9015271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.9015921Z ^ 2025-05-07T20:07:10.9016067Z 2025-05-07T20:07:10.9016343Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:10.9016700Z 2025-05-07T20:07:10.9017572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.9018994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.9019734Z ^ 2025-05-07T20:07:10.9019944Z 2025-05-07T20:07:10.9020796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.9022417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.9023041Z ^ 2025-05-07T20:07:10.9023216Z 2025-05-07T20:07:10.9023462Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:10.9023868Z 2025-05-07T20:07:10.9024764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.9026164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.9026864Z ^ 2025-05-07T20:07:10.9027063Z 2025-05-07T20:07:10.9027940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.9029327Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.9029974Z ^ 2025-05-07T20:07:10.9030117Z 2025-05-07T20:07:10.9030362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:10.9030746Z 2025-05-07T20:07:10.9031612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.9033092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.9033730Z ^ 2025-05-07T20:07:10.9033955Z 2025-05-07T20:07:11.1858986Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:07:11.1870215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.1871605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.1872300Z ^ 2025-05-07T20:07:11.1872443Z 2025-05-07T20:07:11.1872704Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.1873091Z 2025-05-07T20:07:11.1873963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.1875356Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.1875991Z ^ 2025-05-07T20:07:11.1876185Z 2025-05-07T20:07:11.1877032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.1878484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.1879111Z ^ 2025-05-07T20:07:11.1879249Z 2025-05-07T20:07:11.1879486Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.1879850Z 2025-05-07T20:07:11.1880704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.1882097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.1882710Z ^ 2025-05-07T20:07:11.1882916Z 2025-05-07T20:07:11.1883769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.1885155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.1885761Z ^ 2025-05-07T20:07:11.1885898Z 2025-05-07T20:07:11.1886144Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.1886492Z 2025-05-07T20:07:11.1887346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.1888741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.1889369Z ^ 2025-05-07T20:07:11.1889558Z 2025-05-07T20:07:11.1890401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.1891814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.1892437Z ^ 2025-05-07T20:07:11.1892571Z 2025-05-07T20:07:11.1892838Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.1893183Z 2025-05-07T20:07:11.1894046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.1895452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.1896087Z ^ 2025-05-07T20:07:11.1896280Z 2025-05-07T20:07:11.1897133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.1898501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.1899118Z ^ 2025-05-07T20:07:11.1899253Z 2025-05-07T20:07:11.1899501Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.1899916Z 2025-05-07T20:07:11.1900770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.1902189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.1902804Z ^ 2025-05-07T20:07:11.1903008Z 2025-05-07T20:07:11.9956540Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:07:11.9967605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9969057Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9969684Z ^ 2025-05-07T20:07:11.9969823Z 2025-05-07T20:07:11.9970078Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.9970432Z 2025-05-07T20:07:11.9971293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9972686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9973324Z ^ 2025-05-07T20:07:11.9973519Z 2025-05-07T20:07:11.9974364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9977078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9977690Z ^ 2025-05-07T20:07:11.9977844Z 2025-05-07T20:07:11.9978082Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.9978429Z 2025-05-07T20:07:11.9979315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9980779Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9981418Z ^ 2025-05-07T20:07:11.9981617Z 2025-05-07T20:07:11.9982482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9983840Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9984463Z ^ 2025-05-07T20:07:11.9984599Z 2025-05-07T20:07:11.9984837Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.9985198Z 2025-05-07T20:07:11.9986050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9987431Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9988049Z ^ 2025-05-07T20:07:11.9988253Z 2025-05-07T20:07:11.9989094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9990515Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9991157Z ^ 2025-05-07T20:07:11.9991311Z 2025-05-07T20:07:11.9991549Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.9991897Z 2025-05-07T20:07:11.9992768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9994197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9994831Z ^ 2025-05-07T20:07:11.9995023Z 2025-05-07T20:07:11.9995868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9997245Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9997866Z ^ 2025-05-07T20:07:11.9998001Z 2025-05-07T20:07:11.9998238Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.9998597Z 2025-05-07T20:07:11.9999451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:12.0000875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:12.0001497Z ^ 2025-05-07T20:07:12.0001698Z 2025-05-07T20:07:12.5571896Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:07:13.6717803Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:07:13.6729470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6730867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6731482Z ^ 2025-05-07T20:07:13.6731634Z 2025-05-07T20:07:13.6731875Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.6732224Z 2025-05-07T20:07:13.6733102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6734572Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6735203Z ^ 2025-05-07T20:07:13.6735397Z 2025-05-07T20:07:13.6736313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6737682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6738300Z ^ 2025-05-07T20:07:13.6738436Z 2025-05-07T20:07:13.6738675Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.6739075Z 2025-05-07T20:07:13.6740007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6741393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6742011Z ^ 2025-05-07T20:07:13.6742219Z 2025-05-07T20:07:13.6743065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6744437Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6745083Z ^ 2025-05-07T20:07:13.6745235Z 2025-05-07T20:07:13.6745471Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.6745819Z 2025-05-07T20:07:13.6746685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6748065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6748699Z ^ 2025-05-07T20:07:13.6748891Z 2025-05-07T20:07:13.6749738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6751128Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6751745Z ^ 2025-05-07T20:07:13.6751879Z 2025-05-07T20:07:13.6752117Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.6752475Z 2025-05-07T20:07:13.6753330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6754715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6755331Z ^ 2025-05-07T20:07:13.6755534Z 2025-05-07T20:07:13.6756372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6757745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6758378Z ^ 2025-05-07T20:07:13.6758516Z 2025-05-07T20:07:13.6758761Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.6759125Z 2025-05-07T20:07:13.6760002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6761390Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6762004Z ^ 2025-05-07T20:07:13.6762205Z 2025-05-07T20:07:14.0014701Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:07:14.0026448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.0027846Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.0028460Z ^ 2025-05-07T20:07:14.0028615Z 2025-05-07T20:07:14.0028887Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.0029245Z 2025-05-07T20:07:14.0030118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.0031586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.0032223Z ^ 2025-05-07T20:07:14.0032474Z 2025-05-07T20:07:14.0033328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.0034693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.0035322Z ^ 2025-05-07T20:07:14.0035495Z 2025-05-07T20:07:14.0035747Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.0036096Z 2025-05-07T20:07:14.0036957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.0038350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.0038965Z ^ 2025-05-07T20:07:14.0039171Z 2025-05-07T20:07:14.0040013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.0041456Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.0042053Z ^ 2025-05-07T20:07:14.0042203Z 2025-05-07T20:07:14.0042438Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.0042783Z 2025-05-07T20:07:14.0043648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.0045017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.0045642Z ^ 2025-05-07T20:07:14.0045830Z 2025-05-07T20:07:14.0046684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.0048037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.0048654Z ^ 2025-05-07T20:07:14.0048787Z 2025-05-07T20:07:14.0049021Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.0049380Z 2025-05-07T20:07:14.0050230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.0051612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.0052223Z ^ 2025-05-07T20:07:14.0052428Z 2025-05-07T20:07:14.0053269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.0054681Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.0055282Z ^ 2025-05-07T20:07:14.0055432Z 2025-05-07T20:07:14.0055692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.0056038Z 2025-05-07T20:07:14.0056891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.0058307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.0058937Z ^ 2025-05-07T20:07:14.0059126Z 2025-05-07T20:07:14.3252046Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:07:14.3263704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.3265123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.3265755Z ^ 2025-05-07T20:07:14.3265926Z 2025-05-07T20:07:14.3266181Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.3266540Z 2025-05-07T20:07:14.3267503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.3268986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.3269645Z ^ 2025-05-07T20:07:14.3269846Z 2025-05-07T20:07:14.3270702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.3272200Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.3272948Z ^ 2025-05-07T20:07:14.3273090Z 2025-05-07T20:07:14.3273332Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.3273698Z 2025-05-07T20:07:14.3274540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.3275928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.3276554Z ^ 2025-05-07T20:07:14.3276774Z 2025-05-07T20:07:14.3277610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.3279006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.3279609Z ^ 2025-05-07T20:07:14.3279772Z 2025-05-07T20:07:14.3280007Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.3280351Z 2025-05-07T20:07:14.3281185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.3282564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.3283204Z ^ 2025-05-07T20:07:14.3283404Z 2025-05-07T20:07:14.3284233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.3285613Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.3286246Z ^ 2025-05-07T20:07:14.3286386Z 2025-05-07T20:07:14.3286627Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.3286993Z 2025-05-07T20:07:14.3287835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.3289215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.3289834Z ^ 2025-05-07T20:07:14.3290026Z 2025-05-07T20:07:14.3290872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.3292248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.3292904Z ^ 2025-05-07T20:07:14.3293043Z 2025-05-07T20:07:14.3293300Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.3293644Z 2025-05-07T20:07:14.3294490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.3296118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.3296791Z ^ 2025-05-07T20:07:14.3296991Z 2025-05-07T20:07:15.5233175Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:07:15.5244460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5245820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5246420Z ^ 2025-05-07T20:07:15.5246570Z 2025-05-07T20:07:15.5246915Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.5247260Z 2025-05-07T20:07:15.5248119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5249528Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5250149Z ^ 2025-05-07T20:07:15.5250337Z 2025-05-07T20:07:15.5251232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5252558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5253162Z ^ 2025-05-07T20:07:15.5253296Z 2025-05-07T20:07:15.5253527Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.5253878Z 2025-05-07T20:07:15.5254714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5256061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5256662Z ^ 2025-05-07T20:07:15.5256888Z 2025-05-07T20:07:15.5257718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5259057Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5259709Z ^ 2025-05-07T20:07:15.5259855Z 2025-05-07T20:07:15.5260246Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.5260593Z 2025-05-07T20:07:15.5261462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5262833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5263472Z ^ 2025-05-07T20:07:15.5263660Z 2025-05-07T20:07:15.5264503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5265884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5266501Z ^ 2025-05-07T20:07:15.5266636Z 2025-05-07T20:07:15.5266868Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.5267230Z 2025-05-07T20:07:15.5268078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5269473Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5270124Z ^ 2025-05-07T20:07:15.5270326Z 2025-05-07T20:07:15.5271169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5272668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5273260Z ^ 2025-05-07T20:07:15.5273389Z 2025-05-07T20:07:15.5273629Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.5273966Z 2025-05-07T20:07:15.5274826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5276184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5276792Z ^ 2025-05-07T20:07:15.5276976Z 2025-05-07T20:07:18.7054559Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:07:18.7066154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:18.7067600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:18.7068333Z ^ 2025-05-07T20:07:18.7068474Z 2025-05-07T20:07:18.7068715Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:18.7069081Z 2025-05-07T20:07:18.7070000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:18.7071400Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:18.7072026Z ^ 2025-05-07T20:07:18.7072340Z 2025-05-07T20:07:18.7073216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:18.7074560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:18.7075151Z ^ 2025-05-07T20:07:18.7075298Z 2025-05-07T20:07:18.7075530Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:18.7075869Z 2025-05-07T20:07:18.7076710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:18.7078061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:18.7078704Z ^ 2025-05-07T20:07:18.7078889Z 2025-05-07T20:07:18.7079706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:18.7081054Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:18.7081660Z ^ 2025-05-07T20:07:18.7081793Z 2025-05-07T20:07:18.7082027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:18.7082379Z 2025-05-07T20:07:18.7083212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:18.7084571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:18.7085169Z ^ 2025-05-07T20:07:18.7085357Z 2025-05-07T20:07:18.7086191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:18.7087519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:18.7088120Z ^ 2025-05-07T20:07:18.7088251Z 2025-05-07T20:07:18.7088490Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:18.7088827Z 2025-05-07T20:07:18.7089664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:18.7091021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:18.7091671Z ^ 2025-05-07T20:07:18.7091855Z 2025-05-07T20:07:18.7092727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:18.7094067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:18.7094653Z ^ 2025-05-07T20:07:18.7094798Z 2025-05-07T20:07:18.7095027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:18.7095364Z 2025-05-07T20:07:18.7096235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:18.7097566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:18.7098180Z ^ 2025-05-07T20:07:18.7098363Z 2025-05-07T20:07:19.4050896Z [595/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:07:19.4699302Z [596/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:07:19.4700752Z ################################################################################ 2025-05-07T20:07:19.4701114Z [CMAKE] Running post-build script ... 2025-05-07T20:07:19.4701699Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:07:19.4702246Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:07:19.4702643Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:07:19.4703096Z ################################################################################ 2025-05-07T20:08:48.4996961Z [597/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:08:48.5009418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:48.5010803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:48.5011435Z ^ 2025-05-07T20:08:48.5011589Z 2025-05-07T20:08:48.5011834Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:48.5012216Z 2025-05-07T20:08:48.5013069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:48.5014458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:48.5015124Z ^ 2025-05-07T20:08:48.5015345Z 2025-05-07T20:08:48.5016182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:48.5017560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:48.5018181Z ^ 2025-05-07T20:08:48.5018356Z 2025-05-07T20:08:48.5018598Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:48.5018949Z 2025-05-07T20:08:48.5019904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:48.5021485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:48.5022352Z ^ 2025-05-07T20:08:48.5022556Z 2025-05-07T20:08:48.5023425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:48.5024890Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:48.5025545Z ^ 2025-05-07T20:08:48.5025691Z 2025-05-07T20:08:48.5025934Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:48.5026316Z 2025-05-07T20:08:48.5027233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:48.5028654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:48.5029285Z ^ 2025-05-07T20:08:48.5029506Z 2025-05-07T20:08:48.5030369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:48.5031771Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:48.5032398Z ^ 2025-05-07T20:08:48.5032542Z 2025-05-07T20:08:48.5032812Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:48.5033214Z 2025-05-07T20:08:48.5034178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:48.5035551Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:48.5036187Z ^ 2025-05-07T20:08:48.5036384Z 2025-05-07T20:08:48.5037217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:48.5038587Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:48.5039221Z ^ 2025-05-07T20:08:48.5039362Z 2025-05-07T20:08:48.5039603Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:48.5039950Z 2025-05-07T20:08:48.5040814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:48.5042166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:48.5042811Z ^ 2025-05-07T20:08:48.5043004Z 2025-05-07T20:08:49.3323055Z [598/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:08:49.3335610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:49.3337025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:49.3337671Z ^ 2025-05-07T20:08:49.3337817Z 2025-05-07T20:08:49.3338067Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:49.3338424Z 2025-05-07T20:08:49.3339302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:49.3340935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:49.3341618Z ^ 2025-05-07T20:08:49.3341825Z 2025-05-07T20:08:49.3342757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:49.3344189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:49.3344820Z ^ 2025-05-07T20:08:49.3344971Z 2025-05-07T20:08:49.3345256Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:49.3345617Z 2025-05-07T20:08:49.3346485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:49.3347958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:49.3348674Z ^ 2025-05-07T20:08:49.3348879Z 2025-05-07T20:08:49.3349741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:49.3351158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:49.3351986Z ^ 2025-05-07T20:08:49.3352129Z 2025-05-07T20:08:49.3352369Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:49.3352707Z 2025-05-07T20:08:49.3353541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:49.3354843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:49.3355477Z ^ 2025-05-07T20:08:49.3355675Z 2025-05-07T20:08:49.3356502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:49.3357818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:49.3358446Z ^ 2025-05-07T20:08:49.3358587Z 2025-05-07T20:08:49.3358853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:49.3359190Z 2025-05-07T20:08:49.3359995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:49.3361317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:49.3361922Z ^ 2025-05-07T20:08:49.3362143Z 2025-05-07T20:08:49.3362935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:49.3364253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:49.3364840Z ^ 2025-05-07T20:08:49.3365010Z 2025-05-07T20:08:49.3365248Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:49.3365586Z 2025-05-07T20:08:49.3366426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:49.3367725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:49.3368356Z ^ 2025-05-07T20:08:49.3368552Z 2025-05-07T20:08:51.6589621Z [599/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:08:51.6601991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:51.6603314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:51.6603952Z ^ 2025-05-07T20:08:51.6604099Z 2025-05-07T20:08:51.6604364Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:51.6604708Z 2025-05-07T20:08:51.6605526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:51.6606857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:51.6607488Z ^ 2025-05-07T20:08:51.6607687Z 2025-05-07T20:08:51.6608485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:51.6609808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:51.6610441Z ^ 2025-05-07T20:08:51.6610617Z 2025-05-07T20:08:51.6610849Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:51.6611187Z 2025-05-07T20:08:51.6612052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:51.6613362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:51.6614002Z ^ 2025-05-07T20:08:51.6614200Z 2025-05-07T20:08:51.6615050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:51.6616348Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:51.6616969Z ^ 2025-05-07T20:08:51.6617110Z 2025-05-07T20:08:51.6617342Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:51.6617708Z 2025-05-07T20:08:51.6618514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:51.6619909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:51.6620755Z ^ 2025-05-07T20:08:51.6621001Z 2025-05-07T20:08:51.6621863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:51.6624409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:51.6625521Z ^ 2025-05-07T20:08:51.6625804Z 2025-05-07T20:08:51.6626233Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:51.6626859Z 2025-05-07T20:08:51.6628436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:51.6630003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:51.6630680Z ^ 2025-05-07T20:08:51.6630887Z 2025-05-07T20:08:51.6631747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:51.6633169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:51.6633936Z ^ 2025-05-07T20:08:51.6634077Z 2025-05-07T20:08:51.6634312Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:51.6634673Z 2025-05-07T20:08:51.6635479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:51.6636889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:51.6637502Z ^ 2025-05-07T20:08:51.6637724Z 2025-05-07T20:08:53.2682349Z [600/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:08:53.8761760Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:08:53.8955832Z [602/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs" -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib" && : 2025-05-07T20:08:53.9324333Z [603/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:08:53.9325777Z ################################################################################ 2025-05-07T20:08:53.9326181Z [CMAKE] Running post-build script ... 2025-05-07T20:08:53.9326850Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:08:53.9327518Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:53.9327948Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:53.9328379Z ################################################################################ 2025-05-07T20:08:54.0223928Z [604/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:08:54.0225518Z ################################################################################ 2025-05-07T20:08:54.0225913Z [CMAKE] Running post-build script ... 2025-05-07T20:08:54.0226567Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:08:54.0227212Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:54.0227617Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:54.0228150Z ################################################################################ 2025-05-07T20:08:54.0393548Z [605/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:08:54.0397288Z ################################################################################ 2025-05-07T20:08:54.0398326Z [CMAKE] Running post-build script ... 2025-05-07T20:08:54.0399630Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:08:54.0400227Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:54.0400574Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:54.0400979Z ################################################################################ 2025-05-07T20:08:54.1484795Z [606/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:08:54.4777020Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:08:54.4778880Z ################################################################################ 2025-05-07T20:08:54.4779266Z [CMAKE] Running post-build script ... 2025-05-07T20:08:54.4780029Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:08:54.4780695Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:54.4781109Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:54.4781543Z ################################################################################ 2025-05-07T20:08:54.4782547Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:08:54.4817905Z -- Install configuration: "Release" 2025-05-07T20:08:54.4818600Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:08:54.4852060Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:08:54.4852989Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:08:54.4878394Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:08:54.4881219Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:08:54.4903138Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:08:54.4926967Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:08:54.4929887Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:08:54.4932200Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:08:54.4962192Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:08:54.4963463Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:08:54.4964562Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:00.7206413Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:01.8519302Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:04.4674972Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:04.9332058Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:09:04.9333198Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:09:04.9334324Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:09:04.9335586Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:09:04.9336824Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:09:04.9338342Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:09:04.9339578Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:09:04.9340936Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:09:04.9342260Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:09:04.9343630Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:09:04.9344895Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:09:04.9346251Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:09:04.9347659Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:09:04.9348887Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:09:04.9350066Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:09:04.9351255Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:09:04.9352580Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:09:04.9354003Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:09:04.9355101Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:09:04.9380974Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:09:04.9427672Z 2025-05-07T20:09:04.9482357Z 2025-05-07T20:09:04.9482931Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:09:04.9483877Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:09:04.9485035Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:09:04.9485817Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:09:04.9486785Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:09:04.9488024Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:09:04.9489082Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:09:04.9489931Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:09:04.9490815Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:09:04.9491729Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:09:04.9492628Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:09:04.9493747Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:09:04.9494885Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:09:04.9495931Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:09:04.9497021Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:09:04.9498245Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:09:04.9499584Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:09:04.9501090Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:09:04.9502467Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:09:04.9503822Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:09:04.9504946Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:09:04.9505770Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:09:04.9506530Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config 2025-05-07T20:09:04.9507260Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:09:04.9508193Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:09:04.9509145Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs 2025-05-07T20:09:04.9509847Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:09:04.9510687Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:09:04.9511502Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:09:04.9512478Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:09:04.9513557Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:09:04.9514694Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:09:04.9515760Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:09:04.9516673Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:09:04.9517516Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:09:04.9518311Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:09:04.9519068Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:09:04.9520057Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:09:04.9520877Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll 2025-05-07T20:09:04.9521574Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:09:04.9522509Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:09:04.9523197Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:09:04.9523945Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton 2025-05-07T20:09:04.9524715Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:09:04.9525561Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:09:04.9526456Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:09:04.9527402Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:09:04.9528182Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils 2025-05-07T20:09:04.9528931Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:09:04.9529781Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:09:04.9530659Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:09:04.9531579Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:09:04.9532356Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:09:04.9533201Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:09:04.9534066Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:09:04.9534910Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:09:04.9535712Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:09:04.9536606Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:09:04.9537426Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9538265Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:09:04.9539219Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:09:04.9540478Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:09:04.9541803Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:09:04.9543028Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:09:04.9544249Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:09:04.9545624Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:09:04.9547125Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:09:04.9548636Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:09:04.9550033Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:09:04.9551509Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:09:04.9552871Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:09:04.9554188Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:09:04.9555278Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9556066Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:09:04.9557051Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:09:04.9558062Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:09:04.9558986Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:09:04.9560083Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:09:04.9561323Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:09:04.9562345Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:09:04.9563349Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:09:04.9564410Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:09:04.9565636Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:09:04.9566716Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:09:04.9567652Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:09:04.9568452Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:09:04.9569529Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:09:04.9570443Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:04.9571219Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:09:04.9572105Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:09:04.9573022Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:09:04.9573961Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:09:04.9574733Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:09:04.9575533Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:09:04.9576551Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:09:04.9577482Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:09:04.9578271Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:09:04.9579164Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:09:04.9580181Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:09:04.9581115Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:09:04.9582091Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:09:04.9582933Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:04.9583746Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:09:04.9584955Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:09:04.9585984Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:09:04.9586867Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:09:04.9588001Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:09:04.9588683Z 2025-05-07T20:09:04.9678778Z INFO:root:running bdist_wheel 2025-05-07T20:09:04.9714681Z INFO:root:running build 2025-05-07T20:09:04.9715123Z INFO:root:running build_py 2025-05-07T20:09:04.9719619Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9721620Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9723862Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9725283Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9726883Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9728625Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9730213Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9731716Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9733545Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9735090Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9736616Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9738763Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9740368Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9742045Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9743567Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9745118Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9746689Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9748251Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9751178Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9754093Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9755658Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9757614Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9758979Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9760849Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:09:04.9762129Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:09:04.9763776Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:09:04.9766558Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:04.9767814Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:04.9769411Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:04.9771171Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:04.9772716Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:04.9774221Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:04.9775755Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:04.9777235Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:04.9778766Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:04.9780709Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:04.9783098Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:09:04.9784368Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:09:04.9786071Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:09:04.9788082Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll 2025-05-07T20:09:04.9789321Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll 2025-05-07T20:09:04.9791453Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe 2025-05-07T20:09:04.9792653Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe 2025-05-07T20:09:04.9795151Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:04.9796324Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:04.9797952Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:04.9799554Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:04.9801254Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:04.9803615Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:04.9804883Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:04.9806454Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:04.9807977Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:04.9809506Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:04.9811647Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:09:04.9812860Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:09:04.9815034Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:09:04.9817384Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:09:04.9818583Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:09:04.9820282Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:09:04.9823928Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9825078Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9826816Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9828606Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9830235Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9831879Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9833469Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9835108Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9836833Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9838567Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9840317Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9842011Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9843675Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9845344Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:04.9848068Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9849340Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9851060Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9853232Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9854944Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9856606Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9858218Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9859726Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9861304Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9863325Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9864894Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9866428Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:04.9868769Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:09:04.9870150Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:09:04.9871833Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:09:04.9874047Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:04.9875235Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:04.9876821Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:04.9878405Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:04.9880064Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:04.9883383Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:09:04.9884625Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:09:04.9886344Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:09:04.9888642Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:04.9889947Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:04.9891545Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:04.9893063Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:04.9894579Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:04.9896227Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:04.9898486Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:04.9900055Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:04.9901681Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:04.9903726Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:09:04.9904920Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:09:04.9906627Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:09:04.9961220Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:04.9989729Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:05.0317708Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:05.1399374Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:08.5621591Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:08.5627422Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:08.6939694Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:08.7029875Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:08.7249262Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:08.7945301Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:11.5008517Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:11.5823285Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:18.7991357Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.9288721Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:22.7206324Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:23.1863667Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:23.2228023Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:23.4963623Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.4965265Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.4968851Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.4974894Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.4987143Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.4991782Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.4998912Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.5012106Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.5018945Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.5025084Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.5036372Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.5042691Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.5048177Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.5053109Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.5066353Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:23.5070051Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:23.5071653Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:23.5081595Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:23.5088563Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:23.5125120Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0864477Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0865933Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0867277Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0868564Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0870259Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0871927Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0873523Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0875422Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0877044Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0878648Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0880962Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0882564Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0884439Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0885959Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0887705Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0889230Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0890910Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0893919Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0897013Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0898707Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0900362Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0902206Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:24.0903790Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:09:24.0905514Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:09:24.0907126Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:24.0908856Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:24.0910489Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:24.0912168Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:24.0913857Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:24.0916084Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:24.0917909Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:24.0919591Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:24.0921383Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:24.0923282Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:09:24.0927034Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:09:24.0928501Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll 2025-05-07T20:09:24.0930351Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe 2025-05-07T20:09:24.0932040Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:24.0933691Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:24.0935295Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:24.0937117Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:24.0938742Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:24.0940542Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:24.0942334Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:24.0943903Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:24.0945547Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:09:24.0947240Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:09:24.0949478Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:09:24.0951172Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:09:24.0953062Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0954702Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0956464Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0958140Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0960034Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0961675Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0963338Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0965165Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0966912Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0968613Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0970303Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0971976Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0973638Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.0975257Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.0977019Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.0978645Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.0981019Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.0982781Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.0984409Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.0988257Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.0989817Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.0991487Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.0993187Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.0994754Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.0996324Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:09:24.0998115Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:09:24.0999652Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.1001283Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.1002885Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.1004653Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.1007059Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:09:24.1008811Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:09:24.1010700Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.1012496Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.1014023Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.1015582Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.1017243Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.1019242Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:24.1021259Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:24.1023092Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:09:24.1024837Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:09:24.1046395Z INFO:skbuild:copied 90 files 2025-05-07T20:09:24.1047976Z INFO:root:running build_ext 2025-05-07T20:09:24.1048471Z INFO:root:installing to _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:24.1049013Z INFO:root:running install 2025-05-07T20:09:24.1099657Z INFO:root:running install_lib 2025-05-07T20:09:24.1100422Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:24.1101231Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:09:24.1102144Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:09:24.1103639Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:24.1105268Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:24.1106499Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:09:24.1107673Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.1109192Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.1110754Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.1112361Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.1114019Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.1115709Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.1117373Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.1119084Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.1120672Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.1121868Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:09:24.1123267Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:24.1124927Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:24.1126168Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:09:24.1126937Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:09:24.1128151Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:24.1129748Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:24.1130991Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:09:24.1132214Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:24.1133842Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:24.1135042Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1136287Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1137948Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1139673Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1141628Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1143411Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1145164Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1147060Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1149045Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1150979Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1152874Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1154793Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1156658Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1158475Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.1160223Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:09:24.1161384Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:09:24.1162161Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1163400Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1165043Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1166689Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1168337Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1170057Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1171795Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1173496Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1175163Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1176910Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1178680Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1180492Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.1181736Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:09:24.1182948Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:24.1184651Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:24.1185941Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.1186752Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:24.1188062Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:24.1189844Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:24.1191585Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.1193182Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.1194777Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.1196392Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.1197612Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:09:24.1198810Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:24.1200500Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:24.1201782Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.1202970Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.1204617Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.1206276Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.1207887Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.1209576Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.1211159Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:09:24.1212273Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:09:24.1213094Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:09:24.1214371Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:24.1216130Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:24.1217831Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:24.1219391Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:24.1220999Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:24.1222754Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:24.1223960Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:09:24.1225106Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:24.1226674Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:24.1228247Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:24.1229800Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:24.1231328Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.1232834Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.1248887Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.1383038Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.4151777Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.4153536Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.4262189Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.4271621Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.4295802Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.4352400Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.6531723Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.6598366Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.2247187Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.3131590Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.5173352Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.5541486Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.5572904Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.5787169Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5789080Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5791418Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5793636Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5795863Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5797988Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5800174Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5802840Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5805119Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5807294Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5809520Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5811834Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5814029Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5816115Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5818276Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:25.5819994Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:25.5821706Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:25.5824097Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:25.5825976Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.5827526Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6271837Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6273468Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6275238Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6276689Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6278260Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6279969Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6281559Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6283068Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6284599Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6286079Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6287611Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6289273Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6291008Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6292697Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6294319Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6296049Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6297802Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6299557Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6301376Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6303177Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6304854Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6306350Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.6307221Z INFO:skbuild:copied 125 files 2025-05-07T20:09:25.6307560Z INFO:root:running install_egg_info 2025-05-07T20:09:25.6337233Z INFO:root:running egg_info 2025-05-07T20:09:25.6364606Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:09:25.6366877Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:09:25.6369201Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:09:25.6370435Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:09:25.6466242Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:25.6506514Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:25.6507709Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.10.egg-info 2025-05-07T20:09:25.6515267Z INFO:root:running install_scripts 2025-05-07T20:09:25.6516082Z INFO:skbuild:copied 0 files 2025-05-07T20:09:28.4504416Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:09:28.4506761Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-gh9ovt89/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:09:28.4510114Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:09:28.4771259Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:09:28.4781359Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:09:28.4782782Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:09:28.6806742Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:09:28.6941714Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:09:28.7071719Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:09:30.4179635Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:09:30.6202298Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:09:31.3302922Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:09:31.4379148Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:09:32.0315893Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:09:49.8726176Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:09:51.1091181Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:10:18.2560036Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:10:21.0629796Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:10:24.6730683Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:10:25.3644924Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:10:25.5830385Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:10:34.1885113Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:10:45.0983898Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:10:46.5618765Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:10:46.5975671Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:10:46.5978744Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:10:46.5979338Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:10:46.5980297Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:10:46.5983621Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:10:46.5986739Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:10:46.5997804Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:10:46.6001586Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:10:46.6004569Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:10:46.6006231Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:10:46.6007832Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:10:46.6009837Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:10:46.6013180Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:10:46.6038456Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:10:46.6080884Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:10:46.6083540Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:10:46.6085218Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:10:46.6087236Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:10:46.6088853Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:10:46.6090872Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:10:46.6092793Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:10:46.6094840Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:10:46.6096199Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:10:46.6098130Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:10:46.6100931Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:10:46.6102823Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:10:46.6105142Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:10:46.6106853Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:10:46.6112774Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:10:46.6114854Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:10:46.6116735Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:10:46.6118790Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:10:46.6120868Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:10:46.6123321Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:10:46.6129747Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:10:46.6132420Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:10:46.6135082Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:10:46.6137707Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:10:46.6139300Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:10:46.6141467Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:10:46.6143963Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:10:46.6147688Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:10:46.6151760Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:10:46.6153906Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:10:46.6156296Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:10:46.6161949Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:10:46.6167425Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:10:46.6169729Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:10:46.6173594Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:10:46.6179136Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:10:46.6182161Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:10:46.6185249Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:10:46.6189009Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:10:46.6191252Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:10:46.6193290Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:10:46.6196333Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:10:46.6199614Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:10:46.6202623Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:10:46.6205827Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:10:46.6209018Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:10:46.6212149Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:10:46.6215461Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:10:46.6219030Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:10:46.6222176Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:10:46.6224666Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:10:46.6227302Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:10:46.6228915Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:10:46.6230991Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:10:46.6233249Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:10:46.6238296Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:10:46.6240985Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:10:46.6243415Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:10:46.6245429Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:10:46.6247064Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:10:46.6250345Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:10:46.6253244Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:10:46.6255722Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:10:46.6257541Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:10:46.6259332Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:10:46.6261236Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:10:46.6262963Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:10:46.6264384Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:10:46.6270346Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:10:46.6296779Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:10:46.6299485Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:10:46.6302643Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:10:46.6304475Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:10:46.6307275Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:10:46.6309190Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:10:46.6310869Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:10:46.6312744Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:10:46.6315309Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:10:46.6320991Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:10:46.6323556Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:10:46.6325486Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:10:46.6333135Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:10:46.6337787Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:10:46.6339878Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:10:46.6348019Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:10:46.6350407Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:10:46.6352727Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:10:46.6354411Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:10:46.6356711Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:10:46.6359617Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:10:46.6360671Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:10:46.6361590Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:10:46.6368282Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:10:46.6372183Z INFO:root:removing _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:10:46.8104608Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:10:46.8105167Z │ │ Version │ 2025-05-07T20:10:46.8105754Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:10:46.8106309Z │ PyTorch │ 2.8.0.dev20250507+cu126 │ 2025-05-07T20:10:46.8107039Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:10:46.8107619Z │ CUDA (Declared by PyTorch) │ 12.6 │ 2025-05-07T20:10:46.8108202Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:10:46.8108847Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:10:46.8109423Z │ │ Copyright (c) 2005-2024 NVIDIA Corporation │ 2025-05-07T20:10:46.8109921Z │ │ Built on Tue_Oct_29_23:50:19_PDT_2024 │ 2025-05-07T20:10:46.8110435Z │ │ Cuda compilation tools, release 12.6, V12.6.85 │ 2025-05-07T20:10:46.8111005Z │ │ Build cuda_12.6.r12.6/compiler.35059454_0 │ 2025-05-07T20:10:46.8111587Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:10:47.1093195Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:10:47.1964257Z 2025-05-07T20:10:47.2118148Z ################################################################################ 2025-05-07T20:10:47.2119543Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:10:47.2120848Z [CHECK] Listing out library size: 2025-05-07T20:10:47.2122416Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:10:47.2123426Z 2025-05-07T20:10:47.2141731Z 1 ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:10:47.2142015Z 2025-05-07T20:10:47.2142369Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:10:47.2143475Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.2144023Z 2025-05-07T20:10:47.2213813Z GLIBC_2.2.5 2025-05-07T20:10:47.2214169Z GLIBC_2.14 2025-05-07T20:10:47.2214413Z 2025-05-07T20:10:47.2214425Z 2025-05-07T20:10:47.2215193Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:10:47.2216156Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.2216749Z 2025-05-07T20:10:47.2293172Z GLIBCXX_3.4 2025-05-07T20:10:47.2295800Z 2025-05-07T20:10:47.2295816Z 2025-05-07T20:10:47.2321062Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so > /tmp/tmp.tSiQvbpxit.symbols.txt 2025-05-07T20:10:47.2322858Z 2025-05-07T20:10:47.2353288Z 2025-05-07T20:10:47.2386586Z [CHECK] Total Number of symbols: 841 2025-05-07T20:10:47.2404022Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:10:47.2423503Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so > /tmp/tmp.fCdGDigcRg.usymbols.txt 2025-05-07T20:10:47.2425183Z 2025-05-07T20:10:47.2445060Z 2025-05-07T20:10:47.2471252Z [CHECK] Listing out undefined symbols (51 total): 2025-05-07T20:10:47.2489952Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.2490557Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.2490974Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:47.2491335Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:47.2491692Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:10:47.2492031Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.2492381Z U abort@GLIBC_2.2.5 2025-05-07T20:10:47.2492681Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:47.2493104Z U close@GLIBC_2.2.5 2025-05-07T20:10:47.2493495Z U fputs@GLIBC_2.2.5 2025-05-07T20:10:47.2493793Z U free@GLIBC_2.2.5 2025-05-07T20:10:47.2494092Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:10:47.2494387Z U fwrite@GLIBC_2.2.5 2025-05-07T20:10:47.2494690Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:47.2494977Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:10:47.2495292Z U madvise@GLIBC_2.2.5 2025-05-07T20:10:47.2495579Z U malloc@GLIBC_2.2.5 2025-05-07T20:10:47.2495877Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:47.2496301Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.2496607Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.2496883Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.2497184Z U mmap@GLIBC_2.2.5 2025-05-07T20:10:47.2497485Z U mprotect@GLIBC_2.2.5 2025-05-07T20:10:47.2497769Z U munmap@GLIBC_2.2.5 2025-05-07T20:10:47.2498069Z U open64@GLIBC_2.2.5 2025-05-07T20:10:47.2498430Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.2498788Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:10:47.2499113Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:47.2499460Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:47.2499858Z U read@GLIBC_2.2.5 2025-05-07T20:10:47.2500333Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:47.2500670Z U shm_open@GLIBC_2.2.5 2025-05-07T20:10:47.2501021Z U shm_unlink@GLIBC_2.2.5 2025-05-07T20:10:47.2501367Z U snprintf@GLIBC_2.2.5 2025-05-07T20:10:47.2501712Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.2502056Z U stderr@GLIBC_2.2.5 2025-05-07T20:10:47.2502359Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:47.2502683Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.2502982Z U strtol@GLIBC_2.2.5 2025-05-07T20:10:47.2503374Z U syscall@GLIBC_2.2.5 2025-05-07T20:10:47.2503707Z U sysconf@GLIBC_2.2.5 2025-05-07T20:10:47.2504009Z U uname@GLIBC_2.2.5 2025-05-07T20:10:47.2504348Z U unlink@GLIBC_2.2.5 2025-05-07T20:10:47.2504657Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:10:47.2505062Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.2505501Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.2505977Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.2506504Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.2506822Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.2507173Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.2507466Z w __gmon_start__ 2025-05-07T20:10:47.2507812Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.2508213Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:10:47.2508485Z 2025-05-07T20:10:47.2540384Z linux-vdso.so.1 (0x00007ffd187e4000) 2025-05-07T20:10:47.2541521Z libtorch.so => not found 2025-05-07T20:10:47.2542073Z libc10.so => not found 2025-05-07T20:10:47.2542528Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.2542990Z libc10_cuda.so => not found 2025-05-07T20:10:47.2543444Z libnccl.so.2 => not found 2025-05-07T20:10:47.2543851Z libcuda.so.1 => not found 2025-05-07T20:10:47.2544283Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.2544776Z libtorch_cpu.so => not found 2025-05-07T20:10:47.2545276Z libtorch_cuda.so => not found 2025-05-07T20:10:47.2545726Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f83d6a6b000) 2025-05-07T20:10:47.2546191Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f83d6a15000) 2025-05-07T20:10:47.2546619Z librt.so.1 => /lib64/librt.so.1 (0x00007f83d6a0e000) 2025-05-07T20:10:47.2547054Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f83d69e0000) 2025-05-07T20:10:47.2547519Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f83d69db000) 2025-05-07T20:10:47.2547944Z libc.so.6 => /lib64/libc.so.6 (0x00007f83d67d3000) 2025-05-07T20:10:47.2548311Z libm.so.6 => /lib64/libm.so.6 (0x00007f83d66f8000) 2025-05-07T20:10:47.2548695Z /lib64/ld-linux-x86-64.so.2 (0x00007f83d6d4c000) 2025-05-07T20:10:47.2548935Z 2025-05-07T20:10:47.2549058Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.2549452Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:10:47.2549734Z 2025-05-07T20:10:47.2580320Z 2025-05-07T20:10:47.2581048Z Dynamic section at offset 0x75898 contains 39 entries: 2025-05-07T20:10:47.2581691Z Tag Type Name/Value 2025-05-07T20:10:47.2582426Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.2583178Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.2583715Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.2584382Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.2584910Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.2585454Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.2585990Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.2586554Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.2587117Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.2587650Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.2588195Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:47.2588882Z 0x0000000000000001 (NEEDED) Shared library: [librt.so.1] 2025-05-07T20:10:47.2589476Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.2589987Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:10:47.2590505Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.2591011Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:10:47.2591417Z 0x000000000000000c (INIT) 0x19000 2025-05-07T20:10:47.2591758Z 0x000000000000000d (FINI) 0x56a1c 2025-05-07T20:10:47.2592094Z 0x0000000000000019 (INIT_ARRAY) 0x74ac0 2025-05-07T20:10:47.2592468Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.2592827Z 0x000000000000001a (FINI_ARRAY) 0x74ac8 2025-05-07T20:10:47.2593203Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.2593552Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:47.2593986Z 0x0000000000000005 (STRTAB) 0x6980 2025-05-07T20:10:47.2594602Z 0x0000000000000006 (SYMTAB) 0x1a90 2025-05-07T20:10:47.2595207Z 0x000000000000000a (STRSZ) 48829 (bytes) 2025-05-07T20:10:47.2595882Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.2596592Z 0x0000000000000003 (PLTGOT) 0x75fe8 2025-05-07T20:10:47.2597252Z 0x0000000000000002 (PLTRELSZ) 8472 (bytes) 2025-05-07T20:10:47.2597838Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.2598504Z 0x0000000000000017 (JMPREL) 0x162e0 2025-05-07T20:10:47.2599044Z 0x0000000000000007 (RELA) 0x12f98 2025-05-07T20:10:47.2599675Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:10:47.2600382Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.2600954Z 0x000000006ffffffe (VERNEED) 0x12ed8 2025-05-07T20:10:47.2601556Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:47.2602181Z 0x000000006ffffff0 (VERSYM) 0x1283e 2025-05-07T20:10:47.2602733Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:10:47.2603146Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.2603473Z 2025-05-07T20:10:47.2603659Z ################################################################################ 2025-05-07T20:10:47.2604022Z 2025-05-07T20:10:47.2604028Z 2025-05-07T20:10:47.2604253Z ################################################################################ 2025-05-07T20:10:47.2605093Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.2605796Z [CHECK] Listing out library size: 2025-05-07T20:10:47.2606528Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.2607155Z 2025-05-07T20:10:47.2607402Z 1 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.2607698Z 2025-05-07T20:10:47.2608102Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.2609115Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.2609717Z 2025-05-07T20:10:47.2650266Z GLIBC_2.2.5 2025-05-07T20:10:47.2650515Z GLIBC_2.14 2025-05-07T20:10:47.2650668Z 2025-05-07T20:10:47.2650672Z 2025-05-07T20:10:47.2651072Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.2652113Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.2652717Z 2025-05-07T20:10:47.2712109Z GLIBCXX_3.4 2025-05-07T20:10:47.2712686Z GLIBCXX_3.4.9 2025-05-07T20:10:47.2712946Z GLIBCXX_3.4.21 2025-05-07T20:10:47.2716966Z 2025-05-07T20:10:47.2716971Z 2025-05-07T20:10:47.2736460Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.qb9rRhHunR.symbols.txt 2025-05-07T20:10:47.2737157Z 2025-05-07T20:10:47.2764658Z 2025-05-07T20:10:47.2796148Z [CHECK] Total Number of symbols: 116 2025-05-07T20:10:47.2810655Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:10:47.2831054Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.aYJbfthEBR.usymbols.txt 2025-05-07T20:10:47.2831632Z 2025-05-07T20:10:47.2850615Z 2025-05-07T20:10:47.2878507Z [CHECK] Listing out undefined symbols (55 total): 2025-05-07T20:10:47.2897665Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.2898270Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.2898623Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:47.2898978Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.2899308Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:47.2899657Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:47.2900110Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:47.2900464Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:47.2900791Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:10:47.2901296Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.2901632Z U c10::BoolType::get() 2025-05-07T20:10:47.2901977Z U c10::StringType::get() 2025-05-07T20:10:47.2902341Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:47.2903118Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:47.2904381Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.2905218Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:47.2905528Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:47.2905861Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.2906158Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.2906494Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.2906819Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.2907208Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:47.2907673Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:47.2908420Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:47.2909296Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:47.2910176Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:47.2910986Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:47.2911399Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.2911834Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.2912376Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.2912795Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.2913291Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:47.2914240Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.2915038Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:47.2915428Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:47.2915849Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:47.2916199Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:47.2916566Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.2916883Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.2917192Z U strtol@GLIBC_2.2.5 2025-05-07T20:10:47.2917507Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:47.2918342Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:47.2919561Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:10:47.2920734Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:47.2921424Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:10:47.2921861Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.2922499Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:47.2922963Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.2923582Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.2924289Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:47.2924788Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.2925131Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.2925478Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.2925794Z w __gmon_start__ 2025-05-07T20:10:47.2926118Z w __pthread_key_create@GLIBC_2.2.5 2025-05-07T20:10:47.2926493Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.2926974Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.2927284Z 2025-05-07T20:10:47.2941280Z linux-vdso.so.1 (0x00007ffdcedf3000) 2025-05-07T20:10:47.2941628Z libtorch.so => not found 2025-05-07T20:10:47.2941933Z libc10.so => not found 2025-05-07T20:10:47.2942197Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.2942506Z libc10_cuda.so => not found 2025-05-07T20:10:47.2942850Z libnccl.so.2 => not found 2025-05-07T20:10:47.2943158Z libcuda.so.1 => not found 2025-05-07T20:10:47.2943435Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.2943756Z libtorch_cpu.so => not found 2025-05-07T20:10:47.2944073Z libtorch_cuda.so => not found 2025-05-07T20:10:47.2944425Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f274a362000) 2025-05-07T20:10:47.2944888Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f274a30c000) 2025-05-07T20:10:47.2945340Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f274a2dc000) 2025-05-07T20:10:47.2945821Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f274a2d7000) 2025-05-07T20:10:47.2946241Z libc.so.6 => /lib64/libc.so.6 (0x00007f274a0cf000) 2025-05-07T20:10:47.2946627Z libm.so.6 => /lib64/libm.so.6 (0x00007f2749ff4000) 2025-05-07T20:10:47.2947027Z /lib64/ld-linux-x86-64.so.2 (0x00007f274a5d7000) 2025-05-07T20:10:47.2947276Z 2025-05-07T20:10:47.2947395Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.2947843Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.2948174Z 2025-05-07T20:10:47.2979318Z 2025-05-07T20:10:47.2980013Z Dynamic section at offset 0x8c98 contains 38 entries: 2025-05-07T20:10:47.2980434Z Tag Type Name/Value 2025-05-07T20:10:47.2980896Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.2981597Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.2982144Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.2982698Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.2983242Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.2983748Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.2984307Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.2984844Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.2985399Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.2985929Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.2986478Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:47.2987012Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.2987537Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:10:47.2988140Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.2988666Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:10:47.2989137Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:10:47.2989473Z 0x000000000000000d (FINI) 0x6f80 2025-05-07T20:10:47.2989837Z 0x0000000000000019 (INIT_ARRAY) 0x9bb0 2025-05-07T20:10:47.2990217Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:10:47.2990573Z 0x000000000000001a (FINI_ARRAY) 0x9bc0 2025-05-07T20:10:47.2990956Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.2991311Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:47.2991685Z 0x0000000000000005 (STRTAB) 0xed0 2025-05-07T20:10:47.2992023Z 0x0000000000000006 (SYMTAB) 0x3d8 2025-05-07T20:10:47.2992407Z 0x000000000000000a (STRSZ) 7795 (bytes) 2025-05-07T20:10:47.2992771Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.2993142Z 0x0000000000000003 (PLTGOT) 0x9fe8 2025-05-07T20:10:47.2993642Z 0x0000000000000002 (PLTRELSZ) 1632 (bytes) 2025-05-07T20:10:47.2993990Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.2994333Z 0x0000000000000017 (JMPREL) 0x33a0 2025-05-07T20:10:47.2994693Z 0x0000000000000007 (RELA) 0x2ef0 2025-05-07T20:10:47.2995062Z 0x0000000000000008 (RELASZ) 1200 (bytes) 2025-05-07T20:10:47.2995436Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.2995822Z 0x000000006ffffffe (VERNEED) 0x2e30 2025-05-07T20:10:47.2996162Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:47.2996525Z 0x000000006ffffff0 (VERSYM) 0x2d44 2025-05-07T20:10:47.2996885Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:10:47.2997420Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.2997636Z 2025-05-07T20:10:47.2997783Z ################################################################################ 2025-05-07T20:10:47.2998021Z 2025-05-07T20:10:47.2998025Z 2025-05-07T20:10:47.2998148Z ################################################################################ 2025-05-07T20:10:47.2998632Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:10:47.2999105Z [CHECK] Listing out library size: 2025-05-07T20:10:47.2999524Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:10:47.2999845Z 2025-05-07T20:10:47.3000119Z 6 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:10:47.3000509Z 2025-05-07T20:10:47.3001129Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:10:47.3002163Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.3002732Z 2025-05-07T20:10:47.3271500Z GLIBC_2.2.5 2025-05-07T20:10:47.3271756Z GLIBC_2.3 2025-05-07T20:10:47.3272004Z GLIBC_2.14 2025-05-07T20:10:47.3272865Z 2025-05-07T20:10:47.3273037Z 2025-05-07T20:10:47.3273671Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:10:47.3274626Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.3275214Z 2025-05-07T20:10:47.3543397Z GLIBCXX_3.4 2025-05-07T20:10:47.3543656Z GLIBCXX_3.4.9 2025-05-07T20:10:47.3543932Z GLIBCXX_3.4.11 2025-05-07T20:10:47.3544162Z GLIBCXX_3.4.14 2025-05-07T20:10:47.3544415Z GLIBCXX_3.4.15 2025-05-07T20:10:47.3544673Z GLIBCXX_3.4.18 2025-05-07T20:10:47.3544888Z GLIBCXX_3.4.21 2025-05-07T20:10:47.3545153Z 2025-05-07T20:10:47.3545363Z 2025-05-07T20:10:47.3567849Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so > /tmp/tmp.JSccDwuEiH.symbols.txt 2025-05-07T20:10:47.3568299Z 2025-05-07T20:10:47.3791264Z 2025-05-07T20:10:47.3820425Z [CHECK] Total Number of symbols: 4951 2025-05-07T20:10:47.3851886Z [CHECK] Number of fbgemm symbols: 3554 2025-05-07T20:10:47.3869010Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so > /tmp/tmp.xMpHdxlt27.usymbols.txt 2025-05-07T20:10:47.3870293Z 2025-05-07T20:10:47.3900990Z 2025-05-07T20:10:47.3927979Z [CHECK] Listing out undefined symbols (133 total): 2025-05-07T20:10:47.3944908Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.3945968Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:47.3946956Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:47.3947842Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.3948792Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:47.3949702Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:47.3950664Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:47.3951567Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:47.3952512Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:47.3953358Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:10:47.3953716Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:47.3954070Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:47.3954397Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:47.3954866Z U __extendhfsf2 2025-05-07T20:10:47.3955179Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.3955549Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:10:47.3955906Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:47.3956220Z U __truncsfhf2 2025-05-07T20:10:47.3956532Z U abort@GLIBC_2.2.5 2025-05-07T20:10:47.3957095Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:47.3957913Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:47.3958931Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:47.3960136Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:47.3961322Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:10:47.3962158Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:10:47.3962902Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:10:47.3963536Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:10:47.3964174Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:10:47.3964720Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:10:47.3965291Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:10:47.3966043Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:10:47.3966655Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:10:47.3967121Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:10:47.3967741Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:10:47.3968307Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:10:47.3968804Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:10:47.3969299Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:10:47.3969645Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:47.3969962Z U ceilf@GLIBC_2.2.5 2025-05-07T20:10:47.3970286Z U cpuinfo_get_packages 2025-05-07T20:10:47.3970599Z U cpuinfo_get_packages_count 2025-05-07T20:10:47.3970937Z U cpuinfo_initialize 2025-05-07T20:10:47.3971228Z U cpuinfo_isa 2025-05-07T20:10:47.3971524Z U floor@GLIBC_2.2.5 2025-05-07T20:10:47.3971807Z U fma@GLIBC_2.2.5 2025-05-07T20:10:47.3972112Z U fmaf@GLIBC_2.2.5 2025-05-07T20:10:47.3972402Z U free@GLIBC_2.2.5 2025-05-07T20:10:47.3972711Z U fwrite@GLIBC_2.2.5 2025-05-07T20:10:47.3973025Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:47.3973314Z U ldexp@GLIBC_2.2.5 2025-05-07T20:10:47.3973622Z U log2@GLIBC_2.2.5 2025-05-07T20:10:47.3973903Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:47.3974214Z U lrintf@GLIBC_2.2.5 2025-05-07T20:10:47.3974509Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.3974856Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.3975158Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.3975484Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:10:47.3975796Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:10:47.3976246Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.3976600Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:10:47.3976938Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:47.3978784Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:47.3979166Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:10:47.3979473Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:10:47.3979967Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:47.3980642Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:47.3981215Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:47.3981927Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:47.3993537Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:10:47.3994617Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:10:47.3995904Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:47.3996641Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:47.3997179Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:10:47.3997753Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:10:47.3998260Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:10:47.3998948Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:10:47.3999479Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:10:47.3999933Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:10:47.4000290Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:10:47.4000716Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:47.4001074Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:47.4001457Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:10:47.4001836Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:47.4002264Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:10:47.4002697Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.4003097Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:47.4003509Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:47.4004347Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.4005169Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:10:47.4005524Z U std::cout@GLIBCXX_3.4 2025-05-07T20:10:47.4005898Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:10:47.4006330Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:10:47.4006716Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:10:47.4007164Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:47.4007558Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:47.4008219Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:47.4008979Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:10:47.4009538Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:10:47.4010075Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.4010651Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.4011132Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:10:47.4011525Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:47.4011895Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:10:47.4012389Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:10:47.4012953Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:47.4013402Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:47.4013809Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.4014174Z U stderr@GLIBC_2.2.5 2025-05-07T20:10:47.4014510Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:47.4014813Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.4015138Z U strstr@GLIBC_2.2.5 2025-05-07T20:10:47.4015466Z U tolower@GLIBC_2.2.5 2025-05-07T20:10:47.4015776Z U toupper@GLIBC_2.2.5 2025-05-07T20:10:47.4016186Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:10:47.4016720Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:10:47.4017128Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:10:47.4017612Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:47.4018015Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.4018451Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.4018831Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:10:47.4019220Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:10:47.4019571Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.4020039Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.4020534Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.4020886Z w __gmon_start__ 2025-05-07T20:10:47.4021212Z w __pthread_key_create 2025-05-07T20:10:47.4021543Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:47.4021922Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:47.4022447Z w pthread_once 2025-05-07T20:10:47.4022776Z w pthread_rwlock_rdlock 2025-05-07T20:10:47.4023095Z w pthread_rwlock_unlock 2025-05-07T20:10:47.4023444Z w pthread_rwlock_wrlock 2025-05-07T20:10:47.4023765Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:10:47.4024167Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.4024629Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:10:47.4024893Z 2025-05-07T20:10:47.4025046Z linux-vdso.so.1 (0x00007ffc2a17a000) 2025-05-07T20:10:47.4025380Z libc10.so => not found 2025-05-07T20:10:47.4025644Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.4025966Z libc10_cuda.so => not found 2025-05-07T20:10:47.4026249Z libnccl.so.2 => not found 2025-05-07T20:10:47.4026551Z libcuda.so.1 => not found 2025-05-07T20:10:47.4027177Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007fe39f3da000) 2025-05-07T20:10:47.4027799Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.4028124Z libtorch.so => not found 2025-05-07T20:10:47.4028398Z libtorch_cpu.so => not found 2025-05-07T20:10:47.4028710Z libtorch_cuda.so => not found 2025-05-07T20:10:47.4029060Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fe39eb9c000) 2025-05-07T20:10:47.4029502Z libm.so.6 => /lib64/libm.so.6 (0x00007fe39eac1000) 2025-05-07T20:10:47.4029943Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fe39f3aa000) 2025-05-07T20:10:47.4030367Z libc.so.6 => /lib64/libc.so.6 (0x00007fe39e8b9000) 2025-05-07T20:10:47.4030773Z /lib64/ld-linux-x86-64.so.2 (0x00007fe39f457000) 2025-05-07T20:10:47.4031121Z libtorch.so => not found 2025-05-07T20:10:47.4031420Z libc10.so => not found 2025-05-07T20:10:47.4031685Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.4031997Z libc10_cuda.so => not found 2025-05-07T20:10:47.4032274Z libnccl.so.2 => not found 2025-05-07T20:10:47.4032585Z libcuda.so.1 => not found 2025-05-07T20:10:47.4032864Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.4033180Z libtorch_cpu.so => not found 2025-05-07T20:10:47.4033472Z libtorch_cuda.so => not found 2025-05-07T20:10:47.4033847Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fe39e863000) 2025-05-07T20:10:47.4034484Z librt.so.1 => /lib64/librt.so.1 (0x00007fe39f3a1000) 2025-05-07T20:10:47.4034924Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fe39f39c000) 2025-05-07T20:10:47.4035200Z 2025-05-07T20:10:47.4035347Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.4035717Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:10:47.4036022Z 2025-05-07T20:10:47.4047438Z 2025-05-07T20:10:47.4048082Z Dynamic section at offset 0x54d6c8 contains 40 entries: 2025-05-07T20:10:47.4049224Z Tag Type Name/Value 2025-05-07T20:10:47.4050476Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.4052005Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.4053502Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.4054236Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.4054787Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.4055309Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:10:47.4055868Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.4056406Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.4057009Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.4057548Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.4058107Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.4058658Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:47.4059177Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.4059717Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.4060325Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:47.4060895Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:10:47.4061400Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:47.4061854Z 0x000000000000000c (INIT) 0xff000 2025-05-07T20:10:47.4062233Z 0x000000000000000d (FINI) 0x4c1c58 2025-05-07T20:10:47.4062589Z 0x0000000000000019 (INIT_ARRAY) 0x54a1c0 2025-05-07T20:10:47.4062991Z 0x000000000000001b (INIT_ARRAYSZ) 1224 (bytes) 2025-05-07T20:10:47.4063361Z 0x000000000000001a (FINI_ARRAY) 0x54a688 2025-05-07T20:10:47.4063790Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.4064148Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:47.4064523Z 0x0000000000000005 (STRTAB) 0x26de0 2025-05-07T20:10:47.4064893Z 0x0000000000000006 (SYMTAB) 0x9da0 2025-05-07T20:10:47.4065262Z 0x000000000000000a (STRSZ) 754246 (bytes) 2025-05-07T20:10:47.4065664Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.4066022Z 0x0000000000000003 (PLTGOT) 0x551fe8 2025-05-07T20:10:47.4066451Z 0x0000000000000002 (PLTRELSZ) 25992 (bytes) 2025-05-07T20:10:47.4066813Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.4067185Z 0x0000000000000017 (JMPREL) 0xf8458 2025-05-07T20:10:47.4067526Z 0x0000000000000007 (RELA) 0xe1838 2025-05-07T20:10:47.4067917Z 0x0000000000000008 (RELASZ) 93216 (bytes) 2025-05-07T20:10:47.4068318Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.4068683Z 0x000000006ffffffe (VERNEED) 0xe16d8 2025-05-07T20:10:47.4069059Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:47.4069401Z 0x000000006ffffff0 (VERSYM) 0xdf026 2025-05-07T20:10:47.4069775Z 0x000000006ffffff9 (RELACOUNT) 155 2025-05-07T20:10:47.4070104Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.4070343Z 2025-05-07T20:10:47.4070466Z ################################################################################ 2025-05-07T20:10:47.4070729Z 2025-05-07T20:10:47.4070735Z 2025-05-07T20:10:47.4070885Z ################################################################################ 2025-05-07T20:10:47.4071395Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:47.4071922Z [CHECK] Listing out library size: 2025-05-07T20:10:47.4072393Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:47.4072802Z 2025-05-07T20:10:47.4073021Z 3 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:47.4073333Z 2025-05-07T20:10:47.4073752Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:47.4074743Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.4075461Z 2025-05-07T20:10:47.4130047Z GLIBC_2.2.5 2025-05-07T20:10:47.4130853Z GLIBC_2.14 2025-05-07T20:10:47.4131253Z 2025-05-07T20:10:47.4131267Z 2025-05-07T20:10:47.4132476Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:47.4135793Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.4137396Z 2025-05-07T20:10:47.4198410Z GLIBCXX_3.4 2025-05-07T20:10:47.4199128Z GLIBCXX_3.4.9 2025-05-07T20:10:47.4199765Z GLIBCXX_3.4.14 2025-05-07T20:10:47.4200354Z GLIBCXX_3.4.20 2025-05-07T20:10:47.4200960Z GLIBCXX_3.4.21 2025-05-07T20:10:47.4201424Z 2025-05-07T20:10:47.4201429Z 2025-05-07T20:10:47.4220653Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.05KcrLnQAJ.symbols.txt 2025-05-07T20:10:47.4256662Z 2025-05-07T20:10:47.4256671Z 2025-05-07T20:10:47.4285983Z [CHECK] Total Number of symbols: 550 2025-05-07T20:10:47.4300686Z [CHECK] Number of fbgemm symbols: 48 2025-05-07T20:10:47.4318813Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.hCtChHu933.usymbols.txt 2025-05-07T20:10:47.4320289Z 2025-05-07T20:10:47.4340875Z 2025-05-07T20:10:47.4370323Z [CHECK] Listing out undefined symbols (179 total): 2025-05-07T20:10:47.4394122Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.4394912Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.4395321Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.4395744Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.4396172Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.4396572Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:47.4397014Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:47.4397464Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:47.4397843Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.4398258Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:47.4398808Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:47.4399144Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.4399453Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:47.4399785Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:47.4400106Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:47.4400447Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:47.4400782Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:47.4401090Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:47.4401413Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:47.4401771Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.4402284Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:10:47.4402843Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:47.4403321Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:47.4404224Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.4405341Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:10:47.4405751Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:47.4406256Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:47.4406888Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:47.4407959Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:47.4408834Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:47.4409605Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.4410424Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:47.4410798Z U at::get_num_threads() 2025-05-07T20:10:47.4411096Z U at::get_thread_num() 2025-05-07T20:10:47.4411429Z U at::internal::set_thread_num(int) 2025-05-07T20:10:47.4411780Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:10:47.4412151Z U c10::BoolType::get() 2025-05-07T20:10:47.4412502Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:47.4413138Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:47.4413723Z U c10::Error::what() const 2025-05-07T20:10:47.4414069Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.4414547Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.4414967Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:47.4415333Z U c10::IntType::get() 2025-05-07T20:10:47.4415717Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:47.4416107Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:47.4416598Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:47.4417054Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:47.4417432Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:47.4417816Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:47.4418199Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:47.4418853Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:47.4419469Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:47.4419965Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:47.4420522Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:47.4420887Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:47.4421524Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:47.4421859Z U c10::SymIntType::get() 2025-05-07T20:10:47.4422455Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:47.4422833Z U c10::TensorType::get() 2025-05-07T20:10:47.4423194Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:47.4424183Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:47.4425168Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:47.4425627Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:10:47.4426213Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:10:47.4426953Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:10:47.4427561Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:47.4429090Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:47.4429456Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:47.4429821Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:47.4430155Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:47.4430634Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:47.4431085Z U c10::cuda::device_count() 2025-05-07T20:10:47.4431448Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:47.4431823Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:47.4432223Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:47.4432627Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:47.4433018Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:47.4433414Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:47.4434118Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:47.4435016Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:47.4435858Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.4436753Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:47.4437798Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.4438603Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:47.4438930Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:47.4439277Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:47.4439612Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:47.4439981Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:47.4440368Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:47.4440743Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:47.4441139Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:47.4441496Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:47.4441890Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:47.4442222Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:47.4442644Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:47.4443093Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:47.4443456Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:47.4443837Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:47.4444183Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:47.4444547Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:47.4444876Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:47.4445238Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:47.4445608Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:47.4445939Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:47.4446296Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:47.4446629Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:47.4446991Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:47.4447370Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:47.4448412Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4449983Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4451987Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4453683Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4455482Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4457315Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4459176Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4461106Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4462956Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4464804Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4466786Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4468570Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.4469764Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:10:47.4470228Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:10:47.4470713Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:47.4471259Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.4471675Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.4472107Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.4472519Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.4472985Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:47.4473446Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.4473845Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.4474240Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.4474565Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.4474867Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.4475190Z U omp_get_max_threads@OMP_1.0 2025-05-07T20:10:47.4475516Z U omp_get_thread_num@OMP_1.0 2025-05-07T20:10:47.4475875Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.4476228Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:47.4476837Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:47.4477707Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:47.4478319Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:47.4478704Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:47.4479123Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.4479547Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.4479994Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:47.4480516Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:47.4481482Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.4482384Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:47.4482752Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:47.4483111Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:47.4483449Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:47.4483804Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:47.4484194Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.4484728Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.4485213Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:47.4485726Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:10:47.4486658Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:10:47.4487738Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:10:47.4488613Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.4488954Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.4489277Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:47.4490300Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:47.4491521Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.4492376Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.4493185Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:47.4493807Z U typeinfo for c10::Error 2025-05-07T20:10:47.4494160Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:47.4494618Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.4495089Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.4495546Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:47.4496009Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.4496402Z U vtable for c10::Error 2025-05-07T20:10:47.4496973Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.4497664Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:47.4498160Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.4498502Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.4498856Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.4499236Z w __gmon_start__ 2025-05-07T20:10:47.4499530Z w __pthread_key_create 2025-05-07T20:10:47.4500034Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.4500540Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:47.4500866Z 2025-05-07T20:10:47.4500989Z linux-vdso.so.1 (0x00007ffe1e7f6000) 2025-05-07T20:10:47.4501327Z libc10.so => not found 2025-05-07T20:10:47.4501638Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.4501944Z libc10_cuda.so => not found 2025-05-07T20:10:47.4502216Z libnccl.so.2 => not found 2025-05-07T20:10:47.4502510Z libcuda.so.1 => not found 2025-05-07T20:10:47.4503054Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f5eab600000) 2025-05-07T20:10:47.4504012Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f5eabf6f000) 2025-05-07T20:10:47.4504712Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.4505002Z libtorch.so => not found 2025-05-07T20:10:47.4505292Z libtorch_cpu.so => not found 2025-05-07T20:10:47.4505572Z libtorch_cuda.so => not found 2025-05-07T20:10:47.4505872Z libcudart.so.12 => not found 2025-05-07T20:10:47.4506224Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f5eab39c000) 2025-05-07T20:10:47.4506682Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f5eabbaa000) 2025-05-07T20:10:47.4507152Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f5eabf3f000) 2025-05-07T20:10:47.4507547Z libc.so.6 => /lib64/libc.so.6 (0x00007f5eab194000) 2025-05-07T20:10:47.4507911Z libc10.so => not found 2025-05-07T20:10:47.4508167Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.4508471Z libc10_cuda.so => not found 2025-05-07T20:10:47.4508743Z libnccl.so.2 => not found 2025-05-07T20:10:47.4509032Z libcuda.so.1 => not found 2025-05-07T20:10:47.4509568Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f5eab11d000) 2025-05-07T20:10:47.4510178Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.4510467Z libtorch.so => not found 2025-05-07T20:10:47.4510757Z libtorch_cpu.so => not found 2025-05-07T20:10:47.4511059Z libtorch_cuda.so => not found 2025-05-07T20:10:47.4511367Z libm.so.6 => /lib64/libm.so.6 (0x00007f5eab042000) 2025-05-07T20:10:47.4511770Z /lib64/ld-linux-x86-64.so.2 (0x00007f5eabf80000) 2025-05-07T20:10:47.4512116Z libtorch.so => not found 2025-05-07T20:10:47.4512402Z libc10.so => not found 2025-05-07T20:10:47.4512699Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.4512975Z libc10_cuda.so => not found 2025-05-07T20:10:47.4513306Z libnccl.so.2 => not found 2025-05-07T20:10:47.4513569Z libcuda.so.1 => not found 2025-05-07T20:10:47.4513867Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.4514181Z libtorch_cpu.so => not found 2025-05-07T20:10:47.4514466Z libtorch_cuda.so => not found 2025-05-07T20:10:47.4514845Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f5eabf32000) 2025-05-07T20:10:47.4515265Z libtorch.so => not found 2025-05-07T20:10:47.4515520Z libc10.so => not found 2025-05-07T20:10:47.4515799Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.4516069Z libc10_cuda.so => not found 2025-05-07T20:10:47.4516346Z libnccl.so.2 => not found 2025-05-07T20:10:47.4516615Z libcuda.so.1 => not found 2025-05-07T20:10:47.4516911Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.4517204Z libtorch_cpu.so => not found 2025-05-07T20:10:47.4517515Z libtorch_cuda.so => not found 2025-05-07T20:10:47.4517827Z librt.so.1 => /lib64/librt.so.1 (0x00007f5eabf29000) 2025-05-07T20:10:47.4518097Z 2025-05-07T20:10:47.4518219Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.4518677Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:47.4519028Z 2025-05-07T20:10:47.4519033Z 2025-05-07T20:10:47.4519191Z Dynamic section at offset 0x2b5a90 contains 41 entries: 2025-05-07T20:10:47.4519626Z Tag Type Name/Value 2025-05-07T20:10:47.4520046Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.4520593Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.4521123Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.4521632Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.4522307Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.4522808Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:47.4523415Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:47.4523953Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.4524488Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.4524997Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.4525508Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.4526030Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:47.4526540Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.4527051Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:47.4527607Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.4528103Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.4528628Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:47.4529146Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:47.4529562Z 0x000000000000000c (INIT) 0x16000 2025-05-07T20:10:47.4529878Z 0x000000000000000d (FINI) 0x6243c 2025-05-07T20:10:47.4530222Z 0x0000000000000019 (INIT_ARRAY) 0x2b5a40 2025-05-07T20:10:47.4530565Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:10:47.4530920Z 0x000000000000001a (FINI_ARRAY) 0x2b5a88 2025-05-07T20:10:47.4531272Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.4531609Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:47.4531942Z 0x0000000000000005 (STRTAB) 0x40a0 2025-05-07T20:10:47.4532262Z 0x0000000000000006 (SYMTAB) 0xcf8 2025-05-07T20:10:47.4532613Z 0x000000000000000a (STRSZ) 48233 (bytes) 2025-05-07T20:10:47.4532962Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.4533345Z 0x0000000000000003 (PLTGOT) 0x2b6fe8 2025-05-07T20:10:47.4533696Z 0x0000000000000002 (PLTRELSZ) 9240 (bytes) 2025-05-07T20:10:47.4534037Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.4534371Z 0x0000000000000017 (JMPREL) 0x13a68 2025-05-07T20:10:47.4534811Z 0x0000000000000007 (RELA) 0x10258 2025-05-07T20:10:47.4535172Z 0x0000000000000008 (RELASZ) 14352 (bytes) 2025-05-07T20:10:47.4535518Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.4535859Z 0x000000006ffffffe (VERNEED) 0x10158 2025-05-07T20:10:47.4536177Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:47.4536497Z 0x000000006ffffff0 (VERSYM) 0xfd0a 2025-05-07T20:10:47.4536816Z 0x000000006ffffff9 (RELACOUNT) 337 2025-05-07T20:10:47.4537135Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.4537323Z 2025-05-07T20:10:47.4537452Z ################################################################################ 2025-05-07T20:10:47.4537669Z 2025-05-07T20:10:47.4537673Z 2025-05-07T20:10:47.4537776Z ################################################################################ 2025-05-07T20:10:47.4538259Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:47.4538751Z [CHECK] Listing out library size: 2025-05-07T20:10:47.4539190Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:47.4539540Z 2025-05-07T20:10:47.4539804Z 21 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:47.4540098Z 2025-05-07T20:10:47.4540637Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:47.4541655Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.4542230Z 2025-05-07T20:10:47.4584304Z GLIBC_2.2.5 2025-05-07T20:10:47.4584583Z GLIBC_2.14 2025-05-07T20:10:47.4586291Z 2025-05-07T20:10:47.4586305Z 2025-05-07T20:10:47.4586792Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:47.4587802Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.4588422Z 2025-05-07T20:10:47.4662612Z GLIBCXX_3.4 2025-05-07T20:10:47.4663655Z GLIBCXX_3.4.9 2025-05-07T20:10:47.4664363Z GLIBCXX_3.4.11 2025-05-07T20:10:47.4664944Z GLIBCXX_3.4.20 2025-05-07T20:10:47.4665534Z GLIBCXX_3.4.21 2025-05-07T20:10:47.4666254Z 2025-05-07T20:10:47.4666269Z 2025-05-07T20:10:47.4687825Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.WqcNKg7xZl.symbols.txt 2025-05-07T20:10:47.4688296Z 2025-05-07T20:10:47.4730212Z 2025-05-07T20:10:47.4758745Z [CHECK] Total Number of symbols: 783 2025-05-07T20:10:47.4776302Z [CHECK] Number of fbgemm symbols: 73 2025-05-07T20:10:47.4796255Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.cj1mu4jFYS.usymbols.txt 2025-05-07T20:10:47.4796765Z 2025-05-07T20:10:47.4819408Z 2025-05-07T20:10:47.4848638Z [CHECK] Listing out undefined symbols (147 total): 2025-05-07T20:10:47.4870306Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.4870977Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.4871357Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.4871760Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.4872183Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.4872570Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:47.4873161Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:47.4873526Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:47.4873918Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.4874284Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:47.4874600Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.4874934Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:47.4875251Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:47.4875589Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:47.4875910Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:47.4876244Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:47.4876561Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.4876934Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:47.4877358Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:47.4878109Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.4879357Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.4880716Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.4881762Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:47.4882778Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.4883947Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:47.4884588Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:47.4885489Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.4886583Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.4887433Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:47.4887824Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:10:47.4888334Z U c10::BoolType::get() 2025-05-07T20:10:47.4888687Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:47.4889090Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:47.4889483Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.4889922Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:47.4890284Z U c10::IntType::get() 2025-05-07T20:10:47.4890697Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:47.4891195Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:47.4891600Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:47.4892267Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:47.4892927Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:47.4893316Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:47.4893966Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:47.4894322Z U c10::TensorType::get() 2025-05-07T20:10:47.4894651Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:47.4895586Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:47.4896553Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:47.4896919Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:47.4897259Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:47.4897603Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:47.4897948Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:47.4898278Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:47.4898748Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:47.4899206Z U c10::cuda::current_device() 2025-05-07T20:10:47.4899889Z U c10::cuda::device_count() 2025-05-07T20:10:47.4900250Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:47.4900655Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:47.4901097Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:47.4901491Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:47.4901943Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:47.4902327Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:47.4903083Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:47.4903973Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:47.4904835Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.4905793Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:47.4906941Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.4907763Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:47.4908104Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:47.4908463Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:47.4908894Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:47.4909400Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:47.4909741Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:47.4910115Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:47.4910448Z U c10::throwNullDataPtrError() 2025-05-07T20:10:47.4910928Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:47.4911255Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:47.4911656Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:47.4912131Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:47.4912476Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:47.4912878Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:47.4913239Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:47.4913607Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:47.4913963Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:47.4914300Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:47.4914642Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:47.4914978Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:47.4915517Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:47.4915862Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:47.4916215Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:47.4916568Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:47.4916926Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:47.4917284Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:47.4917623Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:47.4917985Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:47.4918781Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:47.4919489Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:47.4919837Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:47.4920182Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:47.4920546Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:47.4920902Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:47.4921399Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.4921879Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.4922461Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.4922937Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:47.4923331Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:47.4923768Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.4924160Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.4924530Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.4924814Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.4925117Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.4925421Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.4925777Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:47.4926368Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:47.4927326Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:47.4927967Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:47.4928374Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.4929023Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.4929455Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:47.4929856Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:47.4930335Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:47.4931212Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.4931996Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:47.4932412Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:47.4932754Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:47.4933109Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:47.4933506Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.4934054Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.4934529Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.4934842Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.4935182Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:47.4935968Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:47.4937082Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.4937898Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.4938642Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:47.4939329Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.4939885Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.4940481Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:47.4940960Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.4941623Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.4942333Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:47.4942824Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.4943159Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.4943498Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.4943804Z w __gmon_start__ 2025-05-07T20:10:47.4944092Z w __pthread_key_create 2025-05-07T20:10:47.4944399Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:47.4944744Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:47.4945122Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.4945574Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:47.4945917Z 2025-05-07T20:10:47.4946073Z linux-vdso.so.1 (0x00007fff4af32000) 2025-05-07T20:10:47.4946476Z libtorch.so => not found 2025-05-07T20:10:47.4946723Z libc10.so => not found 2025-05-07T20:10:47.4946953Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.4947225Z libc10_cuda.so => not found 2025-05-07T20:10:47.4947510Z libnccl.so.2 => not found 2025-05-07T20:10:47.4947755Z libcuda.so.1 => not found 2025-05-07T20:10:47.4948027Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.4948451Z libtorch_cpu.so => not found 2025-05-07T20:10:47.4948732Z libtorch_cuda.so => not found 2025-05-07T20:10:47.4948984Z libcudart.so.12 => not found 2025-05-07T20:10:47.4949305Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f21db59c000) 2025-05-07T20:10:47.4949675Z libm.so.6 => /lib64/libm.so.6 (0x00007f21dcf05000) 2025-05-07T20:10:47.4950041Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f21db546000) 2025-05-07T20:10:47.4950414Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f21dced7000) 2025-05-07T20:10:47.4950785Z libc.so.6 => /lib64/libc.so.6 (0x00007f21db33e000) 2025-05-07T20:10:47.4951141Z /lib64/ld-linux-x86-64.so.2 (0x00007f21dcfe8000) 2025-05-07T20:10:47.4951361Z 2025-05-07T20:10:47.4951466Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.4951913Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:47.4952229Z 2025-05-07T20:10:47.4956159Z 2025-05-07T20:10:47.4956437Z Dynamic section at offset 0x14b76f0 contains 39 entries: 2025-05-07T20:10:47.4956987Z Tag Type Name/Value 2025-05-07T20:10:47.4957440Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.4958137Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.4958727Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.4959255Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.4959758Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.4960282Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.4960797Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.4961334Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.4961850Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.4962382Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:47.4962957Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.4963462Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:47.4963971Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:47.4964476Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.4964989Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.4965537Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:47.4965996Z 0x000000000000000c (INIT) 0x2d000 2025-05-07T20:10:47.4966342Z 0x000000000000000d (FINI) 0xd6d2c 2025-05-07T20:10:47.4966672Z 0x0000000000000019 (INIT_ARRAY) 0x14b5318 2025-05-07T20:10:47.4967037Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:10:47.4967385Z 0x000000000000001a (FINI_ARRAY) 0x14b53e8 2025-05-07T20:10:47.4967748Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.4968090Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:47.4968430Z 0x0000000000000005 (STRTAB) 0x5fa8 2025-05-07T20:10:47.4968752Z 0x0000000000000006 (SYMTAB) 0x1628 2025-05-07T20:10:47.4969110Z 0x000000000000000a (STRSZ) 113302 (bytes) 2025-05-07T20:10:47.4969483Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.4969858Z 0x0000000000000003 (PLTGOT) 0x14b7fe8 2025-05-07T20:10:47.4970229Z 0x0000000000000002 (PLTRELSZ) 10368 (bytes) 2025-05-07T20:10:47.4970685Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.4971022Z 0x0000000000000017 (JMPREL) 0x29e58 2025-05-07T20:10:47.4971340Z 0x0000000000000007 (RELA) 0x22160 2025-05-07T20:10:47.4971698Z 0x0000000000000008 (RELASZ) 31992 (bytes) 2025-05-07T20:10:47.4972047Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.4972406Z 0x000000006ffffffe (VERNEED) 0x22060 2025-05-07T20:10:47.4972919Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:47.4973239Z 0x000000006ffffff0 (VERSYM) 0x21a3e 2025-05-07T20:10:47.4973353Z 0x000000006ffffff9 (RELACOUNT) 498 2025-05-07T20:10:47.4973468Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.4973474Z 2025-05-07T20:10:47.4973590Z ################################################################################ 2025-05-07T20:10:47.4973597Z 2025-05-07T20:10:47.4973601Z 2025-05-07T20:10:47.4973711Z ################################################################################ 2025-05-07T20:10:47.4975808Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:47.4975914Z [CHECK] Listing out library size: 2025-05-07T20:10:47.4976193Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:47.4976198Z 2025-05-07T20:10:47.4976440Z 9 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:47.4976445Z 2025-05-07T20:10:47.4976843Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:47.4977364Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.4977371Z 2025-05-07T20:10:47.5038186Z GLIBC_2.2.5 2025-05-07T20:10:47.5038293Z GLIBC_2.3 2025-05-07T20:10:47.5038378Z GLIBC_2.14 2025-05-07T20:10:47.5038403Z 2025-05-07T20:10:47.5038412Z 2025-05-07T20:10:47.5038846Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:47.5039384Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.5039389Z 2025-05-07T20:10:47.5103688Z GLIBCXX_3.4 2025-05-07T20:10:47.5103948Z GLIBCXX_3.4.9 2025-05-07T20:10:47.5104201Z GLIBCXX_3.4.11 2025-05-07T20:10:47.5104434Z GLIBCXX_3.4.18 2025-05-07T20:10:47.5104679Z GLIBCXX_3.4.21 2025-05-07T20:10:47.5105977Z 2025-05-07T20:10:47.5105997Z 2025-05-07T20:10:47.5131937Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.o2QwXSxKPs.symbols.txt 2025-05-07T20:10:47.5131992Z 2025-05-07T20:10:47.5162301Z 2025-05-07T20:10:47.5189959Z [CHECK] Total Number of symbols: 347 2025-05-07T20:10:47.5203994Z [CHECK] Number of fbgemm symbols: 16 2025-05-07T20:10:47.5223100Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.KeNZN7UnL6.usymbols.txt 2025-05-07T20:10:47.5239946Z 2025-05-07T20:10:47.5239953Z 2025-05-07T20:10:47.5264113Z [CHECK] Listing out undefined symbols (124 total): 2025-05-07T20:10:47.5280995Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.5281748Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.5281971Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.5282138Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.5282318Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.5282642Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.5282785Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:47.5282921Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:47.5283067Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:47.5283207Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.5283314Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:47.5283446Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.5283555Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:47.5283663Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:47.5283776Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:47.5283910Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.5284012Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:47.5284151Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:47.5284348Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:47.5284542Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:47.5284766Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:47.5284890Z U c10::BoolType::get() 2025-05-07T20:10:47.5285055Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:47.5285160Z U c10::FloatType::get() 2025-05-07T20:10:47.5285405Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:47.5285578Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.5285723Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:47.5285843Z U c10::IntType::get() 2025-05-07T20:10:47.5286011Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:47.5286137Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:47.5286280Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:47.5286448Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:47.5286853Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:47.5287005Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:47.5287164Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:47.5287268Z U c10::TensorType::get() 2025-05-07T20:10:47.5287387Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:47.5288109Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:47.5288278Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:47.5288514Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:47.5288632Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:47.5288742Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:47.5288854Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:47.5288977Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:47.5289214Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:47.5289312Z U c10::cuda::device_count() 2025-05-07T20:10:47.5289455Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:47.5289583Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:47.5289767Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:47.5289912Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:47.5290060Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:47.5290169Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:47.5290666Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:47.5290907Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:47.5291371Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.5291705Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:47.5292255Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.5292408Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:47.5292512Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:47.5292632Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:47.5292770Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:47.5292918Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:47.5293021Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:47.5293208Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:47.5293349Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:47.5293478Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:47.5293598Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:47.5293734Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:47.5293840Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:47.5293950Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:47.5294082Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:47.5294240Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:47.5294358Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:47.5294501Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:47.5294610Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:47.5294717Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:47.5294838Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:47.5294969Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:47.5295084Z U float at::Tensor::item() const 2025-05-07T20:10:47.5295255Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.5295409Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.5295549Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.5295641Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.5295748Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.5295837Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.5295949Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.5296086Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:47.5296409Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:47.5296776Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:47.5297122Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:47.5297469Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:47.5297578Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:47.5297700Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:47.5297836Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.5297965Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.5298103Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:47.5298325Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:47.5298863Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.5299017Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:47.5299132Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:47.5299244Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:47.5299365Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:47.5299537Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.5299859Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.5300011Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:47.5300116Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.5300376Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.5300504Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:47.5301164Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:47.5301637Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.5301945Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.5302309Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:47.5302467Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.5302646Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:47.5302812Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.5303166Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.5303414Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:47.5303531Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.5303642Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.5303765Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.5303862Z w __gmon_start__ 2025-05-07T20:10:47.5303962Z w __pthread_key_create 2025-05-07T20:10:47.5304074Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:47.5304208Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:47.5304356Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.5304579Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:47.5304612Z 2025-05-07T20:10:47.5323172Z linux-vdso.so.1 (0x00007ffc8cad5000) 2025-05-07T20:10:47.5323500Z libtorch.so => not found 2025-05-07T20:10:47.5323753Z libc10.so => not found 2025-05-07T20:10:47.5324046Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.5324300Z libc10_cuda.so => not found 2025-05-07T20:10:47.5324555Z libnccl.so.2 => not found 2025-05-07T20:10:47.5324808Z libcuda.so.1 => not found 2025-05-07T20:10:47.5325113Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.5325385Z libtorch_cpu.so => not found 2025-05-07T20:10:47.5325661Z libtorch_cuda.so => not found 2025-05-07T20:10:47.5325959Z libcudart.so.12 => not found 2025-05-07T20:10:47.5326454Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f474cb9c000) 2025-05-07T20:10:47.5326893Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f474d79d000) 2025-05-07T20:10:47.5327414Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f474d76f000) 2025-05-07T20:10:47.5327543Z libc.so.6 => /lib64/libc.so.6 (0x00007f474c994000) 2025-05-07T20:10:47.5327676Z /lib64/ld-linux-x86-64.so.2 (0x00007f474d7fb000) 2025-05-07T20:10:47.5327798Z libm.so.6 => /lib64/libm.so.6 (0x00007f474c8b9000) 2025-05-07T20:10:47.5327970Z 2025-05-07T20:10:47.5328084Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.5328337Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:47.5328342Z 2025-05-07T20:10:47.5357574Z 2025-05-07T20:10:47.5358460Z Dynamic section at offset 0x8a7a10 contains 39 entries: 2025-05-07T20:10:47.5358844Z Tag Type Name/Value 2025-05-07T20:10:47.5359296Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.5359504Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.5359815Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.5360010Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.5360213Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.5360417Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.5360633Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.5360830Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.5361047Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.5361388Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:47.5361588Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.5361796Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:47.5361988Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.5362172Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.5362444Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:47.5362687Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:10:47.5362802Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:10:47.5362908Z 0x000000000000000d (FINI) 0x333cc 2025-05-07T20:10:47.5363042Z 0x0000000000000019 (INIT_ARRAY) 0x8a71f8 2025-05-07T20:10:47.5363166Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:10:47.5363282Z 0x000000000000001a (FINI_ARRAY) 0x8a7228 2025-05-07T20:10:47.5363417Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.5363529Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:47.5363637Z 0x0000000000000005 (STRTAB) 0x2a78 2025-05-07T20:10:47.5363758Z 0x0000000000000006 (SYMTAB) 0x9d8 2025-05-07T20:10:47.5363886Z 0x000000000000000a (STRSZ) 38407 (bytes) 2025-05-07T20:10:47.5364050Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.5364165Z 0x0000000000000003 (PLTGOT) 0x8a7fe8 2025-05-07T20:10:47.5364356Z 0x0000000000000002 (PLTRELSZ) 4728 (bytes) 2025-05-07T20:10:47.5364464Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.5364575Z 0x0000000000000017 (JMPREL) 0xe230 2025-05-07T20:10:47.5364701Z 0x0000000000000007 (RELA) 0xc448 2025-05-07T20:10:47.5364828Z 0x0000000000000008 (RELASZ) 7656 (bytes) 2025-05-07T20:10:47.5364947Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.5365086Z 0x000000006ffffffe (VERNEED) 0xc338 2025-05-07T20:10:47.5365193Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:47.5365304Z 0x000000006ffffff0 (VERSYM) 0xc080 2025-05-07T20:10:47.5365415Z 0x000000006ffffff9 (RELACOUNT) 136 2025-05-07T20:10:47.5365530Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.5365537Z 2025-05-07T20:10:47.5365658Z ################################################################################ 2025-05-07T20:10:47.5365664Z 2025-05-07T20:10:47.5365668Z 2025-05-07T20:10:47.5365810Z ################################################################################ 2025-05-07T20:10:47.5366091Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:47.5366199Z [CHECK] Listing out library size: 2025-05-07T20:10:47.5366456Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:47.5366462Z 2025-05-07T20:10:47.5371404Z 17 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:47.5371560Z 2025-05-07T20:10:47.5372097Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:47.5372594Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.5372602Z 2025-05-07T20:10:47.5428758Z GLIBC_2.2.5 2025-05-07T20:10:47.5428997Z GLIBC_2.14 2025-05-07T20:10:47.5430448Z 2025-05-07T20:10:47.5430751Z 2025-05-07T20:10:47.5432169Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:47.5433706Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.5433724Z 2025-05-07T20:10:47.5495279Z GLIBCXX_3.4 2025-05-07T20:10:47.5495565Z GLIBCXX_3.4.9 2025-05-07T20:10:47.5495830Z GLIBCXX_3.4.20 2025-05-07T20:10:47.5496135Z GLIBCXX_3.4.21 2025-05-07T20:10:47.5496142Z 2025-05-07T20:10:47.5496146Z 2025-05-07T20:10:47.5518108Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.Ww5UkPTtug.symbols.txt 2025-05-07T20:10:47.5518172Z 2025-05-07T20:10:47.5542415Z 2025-05-07T20:10:47.5569997Z [CHECK] Total Number of symbols: 452 2025-05-07T20:10:47.5583908Z [CHECK] Number of fbgemm symbols: 13 2025-05-07T20:10:47.5604970Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.ALjdSPmfC1.usymbols.txt 2025-05-07T20:10:47.5605005Z 2025-05-07T20:10:47.5622648Z 2025-05-07T20:10:47.5648543Z [CHECK] Listing out undefined symbols (149 total): 2025-05-07T20:10:47.5663628Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.5663763Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.5663945Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.5664115Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.5664272Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.5664417Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:47.5664706Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:47.5664831Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:47.5664967Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.5665105Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:47.5665210Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:47.5665317Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.5665422Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:47.5665541Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:47.5665645Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:47.5665748Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:47.5665861Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:47.5665956Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:47.5666064Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.5666317Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:47.5666493Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:47.5667138Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.5667799Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.5667963Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:47.5668137Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:47.5668325Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:47.5668546Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:47.5668660Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:47.5669155Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.5669771Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.5669874Z U c10::BoolType::get() 2025-05-07T20:10:47.5670050Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:47.5670194Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:47.5670293Z U c10::IntType::get() 2025-05-07T20:10:47.5670517Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:47.5670643Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:47.5670870Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:47.5671042Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:47.5671187Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:47.5671597Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:47.5671753Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:47.5671872Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:47.5671987Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:47.5672102Z U c10::SymIntType::get() 2025-05-07T20:10:47.5673352Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:47.5673512Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:47.5673632Z U c10::TensorType::get() 2025-05-07T20:10:47.5673791Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:47.5674519Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:47.5674654Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:47.5674775Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:47.5674910Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:47.5675024Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:47.5675143Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:47.5675259Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:47.5675527Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:47.5675666Z U c10::cuda::current_device() 2025-05-07T20:10:47.5675766Z U c10::cuda::device_count() 2025-05-07T20:10:47.5675922Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:47.5676058Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:47.5676204Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:47.5676359Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:47.5676518Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:47.5676630Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:47.5677166Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:47.5677533Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:47.5678020Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.5678369Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:47.5679244Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.5679377Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:47.5679481Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:47.5679625Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:47.5679821Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:47.5679936Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:47.5680078Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:47.5680205Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:47.5680328Z U c10::throwNullDataPtrError() 2025-05-07T20:10:47.5680429Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:47.5680534Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:47.5680730Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:47.5680841Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:47.5680963Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:47.5681095Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:47.5681246Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:47.5681353Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:47.5681486Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:47.5681592Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:47.5681699Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:47.5681815Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:47.5681948Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:47.5682051Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:47.5682163Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:47.5682306Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:47.5682418Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:47.5682525Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:47.5682645Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:47.5682753Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:47.5682857Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:47.5683151Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:47.5683281Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:47.5683385Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:47.5683490Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:47.5683624Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:47.5683737Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:47.5683854Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.5684002Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.5684090Z U log2@GLIBC_2.2.5 2025-05-07T20:10:47.5684255Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:47.5684379Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.5684532Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.5684621Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.5684711Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.5684811Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.5684917Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.5685050Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:47.5685384Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:47.5685748Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:47.5685860Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:47.5686023Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.5686154Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.5686315Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:47.5686552Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:47.5687084Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.5687204Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:47.5687328Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:47.5687439Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:47.5687571Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:47.5687692Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:47.5687863Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.5688086Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.5688217Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:47.5688321Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.5688412Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.5688542Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:47.5689089Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:47.5689524Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.5689784Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.5690146Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:47.5690267Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:47.5690428Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.5690580Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:47.5690727Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.5691045Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.5691258Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:47.5691366Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.5691479Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.5691577Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.5691662Z w __gmon_start__ 2025-05-07T20:10:47.5691763Z w __pthread_key_create 2025-05-07T20:10:47.5691900Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.5692120Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:47.5692127Z 2025-05-07T20:10:47.5714795Z linux-vdso.so.1 (0x00007ffca34d1000) 2025-05-07T20:10:47.5715151Z libtorch.so => not found 2025-05-07T20:10:47.5715400Z libc10.so => not found 2025-05-07T20:10:47.5715670Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.5715945Z libc10_cuda.so => not found 2025-05-07T20:10:47.5716199Z libnccl.so.2 => not found 2025-05-07T20:10:47.5716475Z libcuda.so.1 => not found 2025-05-07T20:10:47.5716774Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.5717241Z libtorch_cpu.so => not found 2025-05-07T20:10:47.5717518Z libtorch_cuda.so => not found 2025-05-07T20:10:47.5717791Z libcudart.so.12 => not found 2025-05-07T20:10:47.5718317Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fd069b9c000) 2025-05-07T20:10:47.5718698Z libm.so.6 => /lib64/libm.so.6 (0x00007fd06afd2000) 2025-05-07T20:10:47.5732700Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fd06af7c000) 2025-05-07T20:10:47.5732945Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fd06af4e000) 2025-05-07T20:10:47.5733083Z libc.so.6 => /lib64/libc.so.6 (0x00007fd069994000) 2025-05-07T20:10:47.5733217Z /lib64/ld-linux-x86-64.so.2 (0x00007fd06b0b5000) 2025-05-07T20:10:47.5733237Z 2025-05-07T20:10:47.5733367Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.5733607Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:47.5733729Z 2025-05-07T20:10:47.5752858Z 2025-05-07T20:10:47.5754132Z Dynamic section at offset 0x104fa28 contains 39 entries: 2025-05-07T20:10:47.5754643Z Tag Type Name/Value 2025-05-07T20:10:47.5755268Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.5755816Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.5756405Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.5756969Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.5757565Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.5758123Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.5758724Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.5759331Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.5759918Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.5760307Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:47.5761597Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.5761788Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:47.5761986Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:47.5762201Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.5762391Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.5762614Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:47.5762731Z 0x000000000000000c (INIT) 0x11000 2025-05-07T20:10:47.5762859Z 0x000000000000000d (FINI) 0x8746c 2025-05-07T20:10:47.5762981Z 0x0000000000000019 (INIT_ARRAY) 0x104ff20 2025-05-07T20:10:47.5763107Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:10:47.5763247Z 0x000000000000001a (FINI_ARRAY) 0x104ff80 2025-05-07T20:10:47.5763367Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.5763482Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:47.5763611Z 0x0000000000000005 (STRTAB) 0x3660 2025-05-07T20:10:47.5763718Z 0x0000000000000006 (SYMTAB) 0xbe8 2025-05-07T20:10:47.5763848Z 0x000000000000000a (STRSZ) 35790 (bytes) 2025-05-07T20:10:47.5764027Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.5764168Z 0x0000000000000003 (PLTGOT) 0x1050fe8 2025-05-07T20:10:47.5764304Z 0x0000000000000002 (PLTRELSZ) 6480 (bytes) 2025-05-07T20:10:47.5764414Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.5764543Z 0x0000000000000017 (JMPREL) 0xf060 2025-05-07T20:10:47.5764650Z 0x0000000000000007 (RELA) 0xc6a8 2025-05-07T20:10:47.5764822Z 0x0000000000000008 (RELASZ) 10680 (bytes) 2025-05-07T20:10:47.5764943Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.5765077Z 0x000000006ffffffe (VERNEED) 0xc5b8 2025-05-07T20:10:47.5765192Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:47.5765306Z 0x000000006ffffff0 (VERSYM) 0xc22e 2025-05-07T20:10:47.5765433Z 0x000000006ffffff9 (RELACOUNT) 116 2025-05-07T20:10:47.5765551Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.5765557Z 2025-05-07T20:10:47.5765676Z ################################################################################ 2025-05-07T20:10:47.5765681Z 2025-05-07T20:10:47.5765685Z 2025-05-07T20:10:47.5765810Z ################################################################################ 2025-05-07T20:10:47.5766123Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:47.5766260Z [CHECK] Listing out library size: 2025-05-07T20:10:47.5766783Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:47.5766788Z 2025-05-07T20:10:47.5768026Z 2 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:47.5768032Z 2025-05-07T20:10:47.5768470Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:47.5769033Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.5769038Z 2025-05-07T20:10:47.5821536Z GLIBC_2.2.5 2025-05-07T20:10:47.5821893Z GLIBC_2.14 2025-05-07T20:10:47.5822306Z 2025-05-07T20:10:47.5822327Z 2025-05-07T20:10:47.5823668Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:47.5825303Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.5825318Z 2025-05-07T20:10:47.5876623Z GLIBCXX_3.4 2025-05-07T20:10:47.5876873Z GLIBCXX_3.4.9 2025-05-07T20:10:47.5877108Z GLIBCXX_3.4.21 2025-05-07T20:10:47.5877142Z 2025-05-07T20:10:47.5877154Z 2025-05-07T20:10:47.5897641Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.AvNU5COrFh.symbols.txt 2025-05-07T20:10:47.5897648Z 2025-05-07T20:10:47.5918453Z 2025-05-07T20:10:47.5946790Z [CHECK] Total Number of symbols: 277 2025-05-07T20:10:47.5963252Z [CHECK] Number of fbgemm symbols: 44 2025-05-07T20:10:47.5981862Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.laxt0ej8xM.usymbols.txt 2025-05-07T20:10:47.5981875Z 2025-05-07T20:10:47.6002280Z 2025-05-07T20:10:47.6033534Z [CHECK] Listing out undefined symbols (127 total): 2025-05-07T20:10:47.6052250Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.6052361Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.6052521Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.6052688Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.6052823Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.6052968Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:47.6053234Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:47.6053398Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:47.6053575Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.6053679Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:47.6053804Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.6053917Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:47.6054076Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:47.6054259Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:47.6054376Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.6054572Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:10:47.6055182Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.6055830Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.6056114Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:47.6056286Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:47.6056795Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.6056913Z U at::get_thread_num() 2025-05-07T20:10:47.6057025Z U at::internal::set_thread_num(int) 2025-05-07T20:10:47.6057592Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.6057877Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:47.6058053Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.6058213Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:47.6058381Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.6058524Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:47.6058695Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:47.6058832Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:47.6058989Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:47.6059129Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:47.6059262Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:47.6059411Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:47.6059562Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:47.6059679Z U c10::TensorType::get() 2025-05-07T20:10:47.6059870Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:47.6060756Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:47.6060907Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:47.6061079Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:47.6061201Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:47.6061331Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:47.6061484Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:47.6061602Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:47.6061870Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:47.6061972Z U c10::cuda::device_count() 2025-05-07T20:10:47.6062116Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:47.6062280Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:47.6062442Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:47.6062587Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:47.6062747Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:47.6062875Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:47.6063401Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:47.6063656Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:47.6064168Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.6064548Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:47.6064683Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:47.6064795Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:47.6064951Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:47.6065117Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:47.6065256Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:47.6065401Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:47.6065537Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:47.6065671Z U c10::throwNullDataPtrError() 2025-05-07T20:10:47.6065780Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:47.6065893Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:47.6066157Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:47.6066279Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:47.6066439Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:47.6066583Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:47.6066721Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:47.6066837Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:47.6066982Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:47.6067097Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:47.6067212Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:47.6067337Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:47.6067480Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:47.6067591Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:47.6067717Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:47.6067853Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:47.6067970Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:47.6068083Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:47.6068380Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:47.6068519Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:47.6068656Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:47.6068773Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:47.6068918Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:47.6069039Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:47.6069182Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.6069327Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.6069529Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:47.6069667Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.6069785Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.6069886Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.6069982Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.6070098Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.6070239Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:47.6070586Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:47.6070981Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:47.6071115Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:47.6071288Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.6071429Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.6071687Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:47.6072266Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.6072515Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:47.6072629Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:47.6072740Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:47.6072848Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:47.6073034Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.6073142Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.6073236Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.6073363Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:47.6073976Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:47.6074408Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.6074668Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.6075005Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:47.6075165Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.6075317Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:47.6075464Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.6075782Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.6075995Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:47.6076126Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.6076231Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.6076344Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.6076424Z w __gmon_start__ 2025-05-07T20:10:47.6076510Z w __pthread_key_create 2025-05-07T20:10:47.6076663Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.6076884Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:47.6076893Z 2025-05-07T20:10:47.6102476Z linux-vdso.so.1 (0x00007ffd425fb000) 2025-05-07T20:10:47.6102806Z libc10.so => not found 2025-05-07T20:10:47.6103086Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.6103340Z libc10_cuda.so => not found 2025-05-07T20:10:47.6103618Z libnccl.so.2 => not found 2025-05-07T20:10:47.6103909Z libcuda.so.1 => not found 2025-05-07T20:10:47.6105569Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fc47d200000) 2025-05-07T20:10:47.6105855Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.6106129Z libtorch.so => not found 2025-05-07T20:10:47.6106395Z libtorch_cpu.so => not found 2025-05-07T20:10:47.6106663Z libtorch_cuda.so => not found 2025-05-07T20:10:47.6106949Z libcudart.so.12 => not found 2025-05-07T20:10:47.6107413Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc47cf9c000) 2025-05-07T20:10:47.6107922Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc47e484000) 2025-05-07T20:10:47.6108265Z libc.so.6 => /lib64/libc.so.6 (0x00007fc47cd94000) 2025-05-07T20:10:47.6108358Z libtorch.so => not found 2025-05-07T20:10:47.6108448Z libc10.so => not found 2025-05-07T20:10:47.6108541Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.6108651Z libc10_cuda.so => not found 2025-05-07T20:10:47.6108744Z libnccl.so.2 => not found 2025-05-07T20:10:47.6108836Z libcuda.so.1 => not found 2025-05-07T20:10:47.6108952Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.6109051Z libtorch_cpu.so => not found 2025-05-07T20:10:47.6109151Z libtorch_cuda.so => not found 2025-05-07T20:10:47.6109249Z libcudart.so.12 => not found 2025-05-07T20:10:47.6109391Z libm.so.6 => /lib64/libm.so.6 (0x00007fc47e3a5000) 2025-05-07T20:10:47.6109548Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fc47e34f000) 2025-05-07T20:10:47.6109679Z /lib64/ld-linux-x86-64.so.2 (0x00007fc47e661000) 2025-05-07T20:10:47.6109695Z 2025-05-07T20:10:47.6109815Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.6110084Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:47.6110090Z 2025-05-07T20:10:47.6138629Z 2025-05-07T20:10:47.6138939Z Dynamic section at offset 0x16eba8 contains 39 entries: 2025-05-07T20:10:47.6139160Z Tag Type Name/Value 2025-05-07T20:10:47.6140111Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.6140382Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.6140652Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.6140858Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.6141060Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.6141312Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:47.6141537Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.6141744Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.6141988Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.6142193Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.6142400Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:47.6142605Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.6143001Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.6143212Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.6143473Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:47.6143686Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:47.6143819Z 0x000000000000000c (INIT) 0xa000 2025-05-07T20:10:47.6144001Z 0x000000000000000d (FINI) 0x1a14c 2025-05-07T20:10:47.6144155Z 0x0000000000000019 (INIT_ARRAY) 0x16f890 2025-05-07T20:10:47.6144289Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:10:47.6144414Z 0x000000000000001a (FINI_ARRAY) 0x16f8b0 2025-05-07T20:10:47.6144542Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.6144685Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:47.6144804Z 0x0000000000000005 (STRTAB) 0x2108 2025-05-07T20:10:47.6144918Z 0x0000000000000006 (SYMTAB) 0x6f8 2025-05-07T20:10:47.6145079Z 0x000000000000000a (STRSZ) 20443 (bytes) 2025-05-07T20:10:47.6145206Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.6145330Z 0x0000000000000003 (PLTGOT) 0x16ffe8 2025-05-07T20:10:47.6145493Z 0x0000000000000002 (PLTRELSZ) 3936 (bytes) 2025-05-07T20:10:47.6145667Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.6145785Z 0x0000000000000017 (JMPREL) 0x8150 2025-05-07T20:10:47.6145898Z 0x0000000000000007 (RELA) 0x73d0 2025-05-07T20:10:47.6146059Z 0x0000000000000008 (RELASZ) 3456 (bytes) 2025-05-07T20:10:47.6146186Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.6146415Z 0x000000006ffffffe (VERNEED) 0x7310 2025-05-07T20:10:47.6146553Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:47.6146675Z 0x000000006ffffff0 (VERSYM) 0x70e4 2025-05-07T20:10:47.6146787Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:10:47.6146893Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.6146918Z 2025-05-07T20:10:47.6147036Z ################################################################################ 2025-05-07T20:10:47.6147041Z 2025-05-07T20:10:47.6147045Z 2025-05-07T20:10:47.6147161Z ################################################################################ 2025-05-07T20:10:47.6147523Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:47.6147691Z [CHECK] Listing out library size: 2025-05-07T20:10:47.6148017Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:47.6148021Z 2025-05-07T20:10:47.6153718Z 11 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:47.6153777Z 2025-05-07T20:10:47.6156008Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:47.6156575Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.6156603Z 2025-05-07T20:10:47.6618018Z GLIBC_2.2.5 2025-05-07T20:10:47.6618135Z GLIBC_2.3 2025-05-07T20:10:47.6618227Z GLIBC_2.14 2025-05-07T20:10:47.6618265Z 2025-05-07T20:10:47.6618288Z 2025-05-07T20:10:47.6618809Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:47.6619383Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.6619388Z 2025-05-07T20:10:47.7090371Z GLIBCXX_3.4 2025-05-07T20:10:47.7091002Z GLIBCXX_3.4.9 2025-05-07T20:10:47.7091933Z GLIBCXX_3.4.11 2025-05-07T20:10:47.7092535Z GLIBCXX_3.4.15 2025-05-07T20:10:47.7093089Z GLIBCXX_3.4.18 2025-05-07T20:10:47.7093655Z GLIBCXX_3.4.20 2025-05-07T20:10:47.7094191Z GLIBCXX_3.4.21 2025-05-07T20:10:47.7094551Z 2025-05-07T20:10:47.7094565Z 2025-05-07T20:10:47.7109276Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.LimCJCpDHs.symbols.txt 2025-05-07T20:10:47.7109843Z 2025-05-07T20:10:47.7514860Z 2025-05-07T20:10:47.7561459Z [CHECK] Total Number of symbols: 4395 2025-05-07T20:10:47.7601544Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:10:47.7622364Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.FlFnzgFRfw.usymbols.txt 2025-05-07T20:10:47.7622949Z 2025-05-07T20:10:47.7660715Z 2025-05-07T20:10:47.7690828Z [CHECK] Listing out undefined symbols (185 total): 2025-05-07T20:10:47.7712572Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.7715076Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.7716661Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.7717616Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:47.7718544Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:47.7719503Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.7719827Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:47.7720173Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:47.7720512Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:47.7720859Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:47.7721205Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:47.7721525Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:47.7721853Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:47.7722358Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.7722806Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:47.7723194Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:47.7723642Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:47.7723998Z U at::RecordFunction::end() 2025-05-07T20:10:47.7724357Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:47.7724756Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:47.7725408Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:47.7726132Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:47.7726865Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:47.7727519Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:10:47.7728510Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.7729543Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:47.7730016Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:47.7730444Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:47.7730842Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:47.7731257Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:47.7731589Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:47.7731950Z U c10::AnyType::get() 2025-05-07T20:10:47.7732266Z U c10::BoolType::get() 2025-05-07T20:10:47.7732626Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:47.7733103Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:47.7733516Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:47.7734339Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:47.7735730Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:47.7736819Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:47.7737424Z U c10::Error::what() const 2025-05-07T20:10:47.7737750Z U c10::FloatType::get() 2025-05-07T20:10:47.7738058Z U c10::GradMode::is_enabled() 2025-05-07T20:10:47.7738390Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:47.7738754Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:47.7739150Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:47.7739527Z U c10::IValue::isBoolList() const 2025-05-07T20:10:47.7739951Z U c10::IValue::isDoubleList() const 2025-05-07T20:10:47.7740482Z U c10::IValue::isIntList() const 2025-05-07T20:10:47.7740815Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:47.7741196Z U c10::IValue::isTensorList() const 2025-05-07T20:10:47.7741566Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:47.7741947Z U c10::IntType::get() 2025-05-07T20:10:47.7742637Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:47.7743413Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:47.7743835Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:47.7744195Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:47.7744579Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:47.7745032Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:47.7745697Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:47.7746209Z U c10::StringType::get() 2025-05-07T20:10:47.7746561Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:47.7746979Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:47.7747376Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:47.7747808Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:47.7748489Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:47.7749326Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:47.7749721Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:47.7750099Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:47.7750480Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:47.7750847Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:47.7751194Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:47.7751529Z U c10::SymIntType::get() 2025-05-07T20:10:47.7751893Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:47.7752244Z U c10::TensorType::get() 2025-05-07T20:10:47.7752680Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:47.7753323Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:47.7754357Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:47.7755179Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:47.7755995Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.7756880Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:47.7757840Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.7758801Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:47.7759432Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:47.7759820Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:47.7760200Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:47.7760801Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:47.7761384Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:47.7761778Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:47.7762175Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:47.7762571Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:47.7762989Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:47.7763409Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:47.7763878Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:47.7764575Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:47.7765055Z U free@GLIBC_2.2.5 2025-05-07T20:10:47.7765395Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:47.7765768Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:47.7766059Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.7766329Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.7766618Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.7766907Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.7767250Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:47.7767559Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:47.7767961Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:47.7768603Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:47.7769389Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:47.7770174Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:47.7770954Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:47.7772077Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:47.7772768Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:47.7773275Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:47.7773690Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.7774093Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.7774554Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:47.7774987Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:47.7775370Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:47.7775877Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:47.7776883Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.7777778Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:47.7778196Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:47.7778547Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:47.7778913Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:47.7779270Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:47.7779676Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.7780309Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.7780789Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:47.7781208Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:47.7781620Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:47.7782319Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:47.7783029Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:47.7783394Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.7783765Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:47.7784053Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.7784386Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:47.7785232Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:47.7786402Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.7787262Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.7787784Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:47.7788311Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:47.7788919Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:47.7789420Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:47.7789939Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:47.7790635Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:47.7791249Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:47.7791720Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:47.7792202Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:47.7792730Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:47.7793101Z U torch::autograd::Node::metadata() 2025-05-07T20:10:47.7793434Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:47.7793912Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:47.7794489Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:47.7794994Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:47.7795442Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:47.7795950Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:47.7798760Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:47.7801521Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:47.7801923Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:47.7802318Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:47.7803332Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:47.7804351Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:47.7804978Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:47.7805816Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:47.7806366Z U typeinfo for c10::Error 2025-05-07T20:10:47.7806690Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:47.7807059Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:47.7807399Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:47.7807767Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:47.7808119Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:47.7808471Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.7808878Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:47.7809278Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:47.7809712Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.7810075Z U vtable for c10::Error 2025-05-07T20:10:47.7810570Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.7811117Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:47.7811552Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:47.7812165Z U vtable for torch::autograd::Node 2025-05-07T20:10:47.7812559Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:47.7812937Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.7813262Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.7813560Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.7813859Z w __gmon_start__ 2025-05-07T20:10:47.7814121Z w __pthread_key_create 2025-05-07T20:10:47.7814426Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:47.7814735Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:47.7815098Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.7815775Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:47.7816138Z 2025-05-07T20:10:47.7816281Z linux-vdso.so.1 (0x00007ffd2cb50000) 2025-05-07T20:10:47.7816610Z libc10.so => not found 2025-05-07T20:10:47.7817053Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.7817395Z libc10_cuda.so => not found 2025-05-07T20:10:47.7817675Z libnccl.so.2 => not found 2025-05-07T20:10:47.7817928Z libcuda.so.1 => not found 2025-05-07T20:10:47.7818661Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007fedc1c00000) 2025-05-07T20:10:47.7819709Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fedc1800000) 2025-05-07T20:10:47.7820982Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fedc1659000) 2025-05-07T20:10:47.7821822Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.7822461Z libtorch.so => not found 2025-05-07T20:10:47.7823089Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007fedc3f5c000) 2025-05-07T20:10:47.7823747Z libtorch_cpu.so => not found 2025-05-07T20:10:47.7824037Z libtorch_cuda.so => not found 2025-05-07T20:10:47.7824478Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fedc13f5000) 2025-05-07T20:10:47.7824903Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fedc3f2e000) 2025-05-07T20:10:47.7825301Z libc.so.6 => /lib64/libc.so.6 (0x00007fedc11ed000) 2025-05-07T20:10:47.7825670Z /lib64/ld-linux-x86-64.so.2 (0x00007fedc3f6f000) 2025-05-07T20:10:47.7826016Z libtorch.so => not found 2025-05-07T20:10:47.7826259Z libc10.so => not found 2025-05-07T20:10:47.7826522Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.7826803Z libc10_cuda.so => not found 2025-05-07T20:10:47.7827054Z libnccl.so.2 => not found 2025-05-07T20:10:47.7827315Z libcuda.so.1 => not found 2025-05-07T20:10:47.7827570Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.7827856Z libtorch_cpu.so => not found 2025-05-07T20:10:47.7828130Z libtorch_cuda.so => not found 2025-05-07T20:10:47.7828418Z libcudart.so.12 => not found 2025-05-07T20:10:47.7828714Z libm.so.6 => /lib64/libm.so.6 (0x00007fedc3325000) 2025-05-07T20:10:47.7829115Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fedc3ed2000) 2025-05-07T20:10:47.7829471Z libc10.so => not found 2025-05-07T20:10:47.7829707Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.7829975Z libc10_cuda.so => not found 2025-05-07T20:10:47.7830232Z libnccl.so.2 => not found 2025-05-07T20:10:47.7830492Z libcuda.so.1 => not found 2025-05-07T20:10:47.7831067Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007fedc0c00000) 2025-05-07T20:10:47.7831642Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.7831916Z libtorch.so => not found 2025-05-07T20:10:47.7832171Z libtorch_cpu.so => not found 2025-05-07T20:10:47.7832436Z libtorch_cuda.so => not found 2025-05-07T20:10:47.7832704Z libcudart.so.12 => not found 2025-05-07T20:10:47.7833007Z libc10.so => not found 2025-05-07T20:10:47.7833282Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.7833586Z libc10_cuda.so => not found 2025-05-07T20:10:47.7833906Z libnccl.so.2 => not found 2025-05-07T20:10:47.7834164Z libcuda.so.1 => not found 2025-05-07T20:10:47.7834776Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fedbfa00000) 2025-05-07T20:10:47.7835455Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.7835791Z libtorch.so => not found 2025-05-07T20:10:47.7836050Z libtorch_cpu.so => not found 2025-05-07T20:10:47.7836322Z libtorch_cuda.so => not found 2025-05-07T20:10:47.7836593Z libcudart.so.12 => not found 2025-05-07T20:10:47.7836912Z libtorch.so => not found 2025-05-07T20:10:47.7837147Z libc10.so => not found 2025-05-07T20:10:47.7837390Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.7837697Z libc10_cuda.so => not found 2025-05-07T20:10:47.7837963Z libnccl.so.2 => not found 2025-05-07T20:10:47.7838284Z libcuda.so.1 => not found 2025-05-07T20:10:47.7838552Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.7838816Z libtorch_cpu.so => not found 2025-05-07T20:10:47.7839087Z libtorch_cuda.so => not found 2025-05-07T20:10:47.7839498Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fedc3ec1000) 2025-05-07T20:10:47.7839877Z libc10.so => not found 2025-05-07T20:10:47.7840127Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.7840382Z libc10_cuda.so => not found 2025-05-07T20:10:47.7840690Z libnccl.so.2 => not found 2025-05-07T20:10:47.7840937Z libcuda.so.1 => not found 2025-05-07T20:10:47.7841468Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007fedc1b89000) 2025-05-07T20:10:47.7842035Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.7842313Z libtorch.so => not found 2025-05-07T20:10:47.7842562Z libtorch_cpu.so => not found 2025-05-07T20:10:47.7842831Z libtorch_cuda.so => not found 2025-05-07T20:10:47.7843097Z libtorch.so => not found 2025-05-07T20:10:47.7843334Z libc10.so => not found 2025-05-07T20:10:47.7843579Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.7843834Z libc10_cuda.so => not found 2025-05-07T20:10:47.7844098Z libnccl.so.2 => not found 2025-05-07T20:10:47.7844376Z libcuda.so.1 => not found 2025-05-07T20:10:47.7844664Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.7844931Z libtorch_cpu.so => not found 2025-05-07T20:10:47.7845206Z libtorch_cuda.so => not found 2025-05-07T20:10:47.7845469Z libcudart.so.12 => not found 2025-05-07T20:10:47.7845739Z libtorch.so => not found 2025-05-07T20:10:47.7845994Z libc10.so => not found 2025-05-07T20:10:47.7846225Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.7846655Z libc10_cuda.so => not found 2025-05-07T20:10:47.7846904Z libnccl.so.2 => not found 2025-05-07T20:10:47.7847158Z libcuda.so.1 => not found 2025-05-07T20:10:47.7847406Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.7847681Z libtorch_cpu.so => not found 2025-05-07T20:10:47.7847938Z libtorch_cuda.so => not found 2025-05-07T20:10:47.7848258Z librt.so.1 => /lib64/librt.so.1 (0x00007fedc3eb2000) 2025-05-07T20:10:47.7848500Z 2025-05-07T20:10:47.7848610Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.7849108Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:47.7849507Z 2025-05-07T20:10:47.7849560Z 2025-05-07T20:10:47.7849714Z Dynamic section at offset 0xa44058 contains 42 entries: 2025-05-07T20:10:47.7850078Z Tag Type Name/Value 2025-05-07T20:10:47.7850530Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.7851049Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.7851560Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.7852073Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.7852570Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.7854984Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:47.7855545Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:47.7856250Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:47.7856811Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.7857315Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.7857832Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:47.7858341Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.7858853Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.7859366Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.7860012Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.7860695Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.7861202Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:47.7861822Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:10:47.7862389Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:47.7862801Z 0x000000000000000c (INIT) 0x190000 2025-05-07T20:10:47.7863151Z 0x000000000000000d (FINI) 0x8ac368 2025-05-07T20:10:47.7863494Z 0x0000000000000019 (INIT_ARRAY) 0xa37c40 2025-05-07T20:10:47.7863884Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:10:47.7864249Z 0x000000000000001a (FINI_ARRAY) 0xa37d40 2025-05-07T20:10:47.7864632Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.7864994Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:47.7865368Z 0x0000000000000005 (STRTAB) 0x23008 2025-05-07T20:10:47.7865715Z 0x0000000000000006 (SYMTAB) 0x93e8 2025-05-07T20:10:47.7866145Z 0x000000000000000a (STRSZ) 1248185 (bytes) 2025-05-07T20:10:47.7866557Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.7866918Z 0x0000000000000003 (PLTGOT) 0xa47fe8 2025-05-07T20:10:47.7867311Z 0x0000000000000002 (PLTRELSZ) 42648 (bytes) 2025-05-07T20:10:47.7867676Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.7868048Z 0x0000000000000017 (JMPREL) 0x184d90 2025-05-07T20:10:47.7868399Z 0x0000000000000007 (RELA) 0x155f30 2025-05-07T20:10:47.7868802Z 0x0000000000000008 (RELASZ) 192096 (bytes) 2025-05-07T20:10:47.7869166Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.7869529Z 0x000000006ffffffe (VERNEED) 0x155e20 2025-05-07T20:10:47.7869890Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:47.7870224Z 0x000000006ffffff0 (VERSYM) 0x153bc2 2025-05-07T20:10:47.7870579Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:10:47.7870894Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.7871120Z 2025-05-07T20:10:47.7871236Z ################################################################################ 2025-05-07T20:10:47.7871464Z 2025-05-07T20:10:47.7871468Z 2025-05-07T20:10:47.7871600Z ################################################################################ 2025-05-07T20:10:47.7872126Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:47.7872728Z [CHECK] Listing out library size: 2025-05-07T20:10:47.7873152Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:47.7873514Z 2025-05-07T20:10:47.7873712Z 429 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:47.7874016Z 2025-05-07T20:10:47.7874432Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:47.7875361Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.7875943Z 2025-05-07T20:10:47.8231039Z GLIBC_2.2.5 2025-05-07T20:10:47.8231799Z GLIBC_2.14 2025-05-07T20:10:47.8237112Z 2025-05-07T20:10:47.8237126Z 2025-05-07T20:10:47.8238905Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:47.8241548Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.8242193Z 2025-05-07T20:10:47.8630700Z GLIBCXX_3.4 2025-05-07T20:10:47.8631444Z GLIBCXX_3.4.9 2025-05-07T20:10:47.8631956Z GLIBCXX_3.4.11 2025-05-07T20:10:47.8632268Z GLIBCXX_3.4.14 2025-05-07T20:10:47.8632501Z GLIBCXX_3.4.18 2025-05-07T20:10:47.8632749Z GLIBCXX_3.4.20 2025-05-07T20:10:47.8632980Z GLIBCXX_3.4.21 2025-05-07T20:10:47.8634493Z 2025-05-07T20:10:47.8634497Z 2025-05-07T20:10:47.8662389Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.DCcb0LU8Wd.symbols.txt 2025-05-07T20:10:47.8662918Z 2025-05-07T20:10:47.9045409Z 2025-05-07T20:10:47.9083717Z [CHECK] Total Number of symbols: 5083 2025-05-07T20:10:47.9112740Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:10:47.9129218Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.XWIsjDWxkh.usymbols.txt 2025-05-07T20:10:47.9129776Z 2025-05-07T20:10:47.9164277Z 2025-05-07T20:10:47.9199892Z [CHECK] Listing out undefined symbols (246 total): 2025-05-07T20:10:47.9221786Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.9224531Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.9225101Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.9225654Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.9226078Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:47.9226462Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.9226856Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:47.9227251Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:47.9227601Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:47.9227972Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:47.9228330Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:47.9228664Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:47.9228972Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.9229300Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:47.9229613Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:47.9229947Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:47.9230272Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:47.9230587Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:47.9230910Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:47.9231209Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:47.9231716Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.9232111Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:47.9232961Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.9234240Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.9235536Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.9236549Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:47.9237310Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.9238070Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:47.9238645Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:47.9239752Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:47.9240939Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.9241610Z U at::detail::getCUDAHooks() 2025-05-07T20:10:47.9241935Z U at::detail::getHIPHooks() 2025-05-07T20:10:47.9242229Z U at::get_thread_num() 2025-05-07T20:10:47.9242529Z U at::globalContext() 2025-05-07T20:10:47.9242845Z U at::internal::set_thread_num(int) 2025-05-07T20:10:47.9243224Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:47.9243686Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.9244166Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.9244657Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:10:47.9245277Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:10:47.9245896Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:47.9246825Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:47.9247946Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:47.9248514Z U c10::Error::what() const 2025-05-07T20:10:47.9248828Z U c10::GradMode::is_enabled() 2025-05-07T20:10:47.9249155Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:47.9249530Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.9249954Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.9250402Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:47.9250798Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:10:47.9251168Z U c10::IValue::isTensorList() const 2025-05-07T20:10:47.9251541Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:47.9251878Z U c10::IntType::get() 2025-05-07T20:10:47.9252558Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:47.9253347Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:47.9253738Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:47.9254075Z U c10::NoneType::get() 2025-05-07T20:10:47.9254482Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:47.9254952Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:47.9255323Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:47.9255705Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:47.9256088Z U c10::StringType::get() 2025-05-07T20:10:47.9256431Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:47.9256839Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:47.9257486Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:47.9258173Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:47.9258543Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:47.9258907Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:47.9259613Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:47.9260513Z U c10::TensorType::get() 2025-05-07T20:10:47.9261503Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:10:47.9262536Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:47.9263494Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:47.9264597Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:10:47.9265101Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:47.9265476Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:47.9265856Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:47.9266247Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:47.9266602Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:47.9266978Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:47.9267467Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:47.9267969Z U c10::cuda::device_count() 2025-05-07T20:10:47.9268332Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:47.9268737Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:47.9269135Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:47.9269526Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:47.9269943Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:47.9270322Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:47.9271026Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:47.9272118Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:47.9273823Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:47.9275150Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:47.9275989Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.9276878Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:47.9277877Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.9278945Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:10:47.9279330Z U c10::get_default_dtype() 2025-05-07T20:10:47.9279831Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:47.9280451Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:47.9280890Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:47.9281232Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:47.9281563Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:47.9282182Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:47.9282850Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:10:47.9283273Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:10:47.9283791Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:10:47.9284290Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:47.9284745Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:47.9285187Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:10:47.9285573Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:47.9286008Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:47.9286435Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:47.9286820Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:47.9287197Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:47.9287584Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:47.9287958Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:47.9288312Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:47.9288672Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:47.9289023Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:47.9289408Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:47.9289758Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:47.9290127Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:47.9290475Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:47.9290883Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:47.9291269Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:47.9292264Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9294005Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9295781Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9297535Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9299155Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9301221Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9302955Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:47.9304674Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:47.9306466Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9308325Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:47.9310248Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9312079Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:47.9313903Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:47.9315685Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9317354Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:47.9318153Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:47.9318981Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9319838Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:47.9320688Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9321482Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:47.9322694Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:47.9323665Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9324529Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9325478Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9326460Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9327330Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9328290Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9329236Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:47.9329428Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.9329599Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.9329784Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.9329956Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.9330397Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:10:47.9330587Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:47.9330801Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.9330965Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.9331564Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:10:47.9332027Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:10:47.9332136Z U memchr@GLIBC_2.2.5 2025-05-07T20:10:47.9332244Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.9332379Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.9332489Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.9332616Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.9332826Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:47.9333054Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:47.9333409Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:47.9333833Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:47.9334173Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:47.9334920Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream(std::__cxx11::basic_string, std::allocator > const&, std::_Ios_Openmode)@GLIBCXX_3.4.21 2025-05-07T20:10:47.9335336Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:47.9335722Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:47.9335928Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:10:47.9336084Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:10:47.9336215Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:47.9336374Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:47.9336527Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:47.9336678Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.9336822Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.9337031Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:47.9337174Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:47.9337427Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:47.9338043Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.9338240Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:10:47.9338456Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:10:47.9338803Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:10:47.9338933Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:10:47.9339091Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:47.9339217Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:47.9339379Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:47.9339505Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:47.9339724Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:10:47.9339916Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:47.9340335Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:47.9340505Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:10:47.9340695Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.9340981Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.9341136Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:47.9341340Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:47.9341487Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:47.9341741Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:10:47.9341931Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:47.9342077Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:47.9342227Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.9342337Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:47.9342440Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.9342595Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:47.9343196Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:47.9343672Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.9343991Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:47.9345083Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:10:47.9345487Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:10:47.9345859Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:47.9346273Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:10:47.9346452Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:10:47.9346776Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:10:47.9347267Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:10:47.9347622Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:10:47.9347802Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:10:47.9348332Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:10:47.9348481Z U typeinfo for c10::Error 2025-05-07T20:10:47.9348620Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:47.9348759Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:10:47.9348923Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:47.9349064Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:47.9349262Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:47.9349506Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:47.9349666Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.9349835Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:47.9350027Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.9350162Z U vtable for c10::Error 2025-05-07T20:10:47.9350509Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.9350772Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:47.9350913Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:47.9351036Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.9351182Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.9351299Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.9351399Z w __gmon_start__ 2025-05-07T20:10:47.9351525Z w __pthread_key_create 2025-05-07T20:10:47.9351648Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:47.9351769Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:47.9351923Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.9352174Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:47.9352181Z 2025-05-07T20:10:47.9352403Z linux-vdso.so.1 (0x00007ffcba1b9000) 2025-05-07T20:10:47.9352527Z libc10.so => not found 2025-05-07T20:10:47.9352655Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.9352758Z libc10_cuda.so => not found 2025-05-07T20:10:47.9352863Z libnccl.so.2 => not found 2025-05-07T20:10:47.9352958Z libcuda.so.1 => not found 2025-05-07T20:10:47.9353347Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f7176400000) 2025-05-07T20:10:47.9353812Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f7174c00000) 2025-05-07T20:10:47.9354249Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f7191bd4000) 2025-05-07T20:10:47.9354383Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.9354485Z libtorch.so => not found 2025-05-07T20:10:47.9354591Z libtorch_cpu.so => not found 2025-05-07T20:10:47.9354717Z libtorch_cuda.so => not found 2025-05-07T20:10:47.9354818Z libcudart.so.12 => not found 2025-05-07T20:10:47.9354986Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f717499c000) 2025-05-07T20:10:47.9355162Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f7191ba4000) 2025-05-07T20:10:47.9355290Z libc.so.6 => /lib64/libc.so.6 (0x00007f7174794000) 2025-05-07T20:10:47.9355385Z libc10.so => not found 2025-05-07T20:10:47.9355485Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.9355630Z libc10_cuda.so => not found 2025-05-07T20:10:47.9355734Z libnccl.so.2 => not found 2025-05-07T20:10:47.9355833Z libcuda.so.1 => not found 2025-05-07T20:10:47.9356208Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f7191b2b000) 2025-05-07T20:10:47.9356315Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.9356416Z libtorch.so => not found 2025-05-07T20:10:47.9356517Z libtorch_cpu.so => not found 2025-05-07T20:10:47.9356669Z libtorch_cuda.so => not found 2025-05-07T20:10:47.9356797Z libm.so.6 => /lib64/libm.so.6 (0x00007f7176325000) 2025-05-07T20:10:47.9356934Z /lib64/ld-linux-x86-64.so.2 (0x00007f7191be5000) 2025-05-07T20:10:47.9357057Z libtorch.so => not found 2025-05-07T20:10:47.9357151Z libc10.so => not found 2025-05-07T20:10:47.9357252Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.9357351Z libc10_cuda.so => not found 2025-05-07T20:10:47.9357461Z libnccl.so.2 => not found 2025-05-07T20:10:47.9357560Z libcuda.so.1 => not found 2025-05-07T20:10:47.9357669Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.9357792Z libtorch_cpu.so => not found 2025-05-07T20:10:47.9357893Z libtorch_cuda.so => not found 2025-05-07T20:10:47.9357994Z libcudart.so.12 => not found 2025-05-07T20:10:47.9358148Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f71769aa000) 2025-05-07T20:10:47.9358266Z libtorch.so => not found 2025-05-07T20:10:47.9358384Z libc10.so => not found 2025-05-07T20:10:47.9358487Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.9358608Z libc10_cuda.so => not found 2025-05-07T20:10:47.9358705Z libnccl.so.2 => not found 2025-05-07T20:10:47.9358802Z libcuda.so.1 => not found 2025-05-07T20:10:47.9358906Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.9359027Z libtorch_cpu.so => not found 2025-05-07T20:10:47.9359128Z libtorch_cuda.so => not found 2025-05-07T20:10:47.9359305Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f7191b1c000) 2025-05-07T20:10:47.9359426Z libtorch.so => not found 2025-05-07T20:10:47.9359523Z libc10.so => not found 2025-05-07T20:10:47.9359626Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.9359724Z libc10_cuda.so => not found 2025-05-07T20:10:47.9359850Z libnccl.so.2 => not found 2025-05-07T20:10:47.9359949Z libcuda.so.1 => not found 2025-05-07T20:10:47.9360054Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.9360176Z libtorch_cpu.so => not found 2025-05-07T20:10:47.9360278Z libtorch_cuda.so => not found 2025-05-07T20:10:47.9360417Z librt.so.1 => /lib64/librt.so.1 (0x00007f71769a1000) 2025-05-07T20:10:47.9360423Z 2025-05-07T20:10:47.9360555Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.9360830Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:47.9360835Z 2025-05-07T20:10:47.9360865Z 2025-05-07T20:10:47.9361035Z Dynamic section at offset 0x1ac7bfc8 contains 41 entries: 2025-05-07T20:10:47.9361183Z Tag Type Name/Value 2025-05-07T20:10:47.9361380Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.9361590Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.9361817Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.9362016Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.9362215Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.9362441Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:47.9362662Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:47.9362881Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:47.9363097Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.9363315Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.9363542Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.9363749Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.9363974Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:47.9364176Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.9364372Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.9366002Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.9366252Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:10:47.9366449Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:47.9366594Z 0x000000000000000c (INIT) 0x1a0000 2025-05-07T20:10:47.9366713Z 0x000000000000000d (FINI) 0x74838c 2025-05-07T20:10:47.9366838Z 0x0000000000000019 (INIT_ARRAY) 0x1ac7aca0 2025-05-07T20:10:47.9366975Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:10:47.9367128Z 0x000000000000001a (FINI_ARRAY) 0x1ac7ae28 2025-05-07T20:10:47.9367270Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.9367391Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:47.9367535Z 0x0000000000000005 (STRTAB) 0x27a50 2025-05-07T20:10:47.9367695Z 0x0000000000000006 (SYMTAB) 0x9db0 2025-05-07T20:10:47.9367847Z 0x000000000000000a (STRSZ) 1387089 (bytes) 2025-05-07T20:10:47.9367974Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.9368137Z 0x0000000000000003 (PLTGOT) 0x1ac84fe8 2025-05-07T20:10:47.9368279Z 0x0000000000000002 (PLTRELSZ) 20568 (bytes) 2025-05-07T20:10:47.9368398Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.9368550Z 0x0000000000000017 (JMPREL) 0x19af18 2025-05-07T20:10:47.9368670Z 0x0000000000000007 (RELA) 0x17cd80 2025-05-07T20:10:47.9368813Z 0x0000000000000008 (RELASZ) 123288 (bytes) 2025-05-07T20:10:47.9368967Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.9369092Z 0x000000006ffffffe (VERNEED) 0x17cc60 2025-05-07T20:10:47.9369214Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:47.9369332Z 0x000000006ffffff0 (VERSYM) 0x17a4a2 2025-05-07T20:10:47.9369478Z 0x000000006ffffff9 (RELACOUNT) 539 2025-05-07T20:10:47.9369590Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.9369594Z 2025-05-07T20:10:47.9369721Z ################################################################################ 2025-05-07T20:10:47.9369762Z 2025-05-07T20:10:47.9369766Z 2025-05-07T20:10:47.9369915Z ################################################################################ 2025-05-07T20:10:47.9370280Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:47.9370401Z [CHECK] Listing out library size: 2025-05-07T20:10:47.9370776Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:47.9370780Z 2025-05-07T20:10:47.9371052Z 5 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:47.9371056Z 2025-05-07T20:10:47.9371524Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:47.9372105Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.9372112Z 2025-05-07T20:10:47.9615928Z GLIBC_2.2.5 2025-05-07T20:10:47.9616382Z GLIBC_2.3 2025-05-07T20:10:47.9616639Z GLIBC_2.14 2025-05-07T20:10:47.9616658Z 2025-05-07T20:10:47.9616671Z 2025-05-07T20:10:47.9618436Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:47.9620465Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.9620484Z 2025-05-07T20:10:47.9880385Z GLIBCXX_3.4 2025-05-07T20:10:47.9881283Z GLIBCXX_3.4.9 2025-05-07T20:10:47.9881579Z GLIBCXX_3.4.11 2025-05-07T20:10:47.9881858Z GLIBCXX_3.4.15 2025-05-07T20:10:47.9882386Z GLIBCXX_3.4.18 2025-05-07T20:10:47.9882630Z GLIBCXX_3.4.20 2025-05-07T20:10:47.9882875Z GLIBCXX_3.4.21 2025-05-07T20:10:47.9882925Z 2025-05-07T20:10:47.9882939Z 2025-05-07T20:10:47.9902891Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.RnwfFuqAmC.symbols.txt 2025-05-07T20:10:47.9902932Z 2025-05-07T20:10:48.0120651Z 2025-05-07T20:10:48.0145521Z [CHECK] Total Number of symbols: 2987 2025-05-07T20:10:48.0168289Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:10:48.0183789Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.DyGvNahgHR.usymbols.txt 2025-05-07T20:10:48.0183830Z 2025-05-07T20:10:48.0217635Z 2025-05-07T20:10:48.0246796Z [CHECK] Listing out undefined symbols (189 total): 2025-05-07T20:10:48.0263097Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.0263642Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.0263791Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.0263929Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:48.0264043Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.0264192Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.0264308Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.0264496Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:48.0264610Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.0264724Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.0264854Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.0264960Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.0265066Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:48.0265190Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.0265319Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:48.0265503Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:48.0265688Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:48.0265852Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:48.0265968Z U at::RecordFunction::end() 2025-05-07T20:10:48.0266104Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:48.0266286Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:48.0267045Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.0267381Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:48.0267999Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.0268710Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.0268908Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:48.0269103Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:48.0269306Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:48.0269487Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:48.0269704Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.0269848Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:48.0270049Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:48.0270302Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:48.0270519Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:48.0270621Z U c10::AnyType::get() 2025-05-07T20:10:48.0270749Z U c10::BoolType::get() 2025-05-07T20:10:48.0270927Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:48.0271045Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:48.0271558Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:48.0272200Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:48.0272584Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.0272695Z U c10::Error::what() const 2025-05-07T20:10:48.0272798Z U c10::FloatType::get() 2025-05-07T20:10:48.0272908Z U c10::GradMode::is_enabled() 2025-05-07T20:10:48.0273045Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:48.0273199Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:48.0273317Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:48.0273448Z U c10::IValue::isBoolList() const 2025-05-07T20:10:48.0273560Z U c10::IValue::isIntList() const 2025-05-07T20:10:48.0273680Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:48.0273816Z U c10::IValue::isTensorList() const 2025-05-07T20:10:48.0273985Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.0274085Z U c10::IntType::get() 2025-05-07T20:10:48.0274565Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.0274739Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.0274862Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.0275013Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.0275141Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.0275360Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.0275663Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:48.0275769Z U c10::StringType::get() 2025-05-07T20:10:48.0275918Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:48.0276081Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.0276248Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:48.0276424Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:48.0276600Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:48.0276985Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.0277123Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.0277273Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:48.0277436Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:10:48.0277572Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:48.0277713Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.0277844Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:48.0277971Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:48.0278101Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:48.0278234Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:48.0278338Z U c10::SymIntType::get() 2025-05-07T20:10:48.0278458Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:48.0278580Z U c10::TensorType::get() 2025-05-07T20:10:48.0278703Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.0279144Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.0279655Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.0279903Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.0280371Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.0280719Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.0281270Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.0281611Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:48.0281819Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:48.0281940Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.0282121Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:48.0282478Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.0282602Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:48.0282786Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:48.0282935Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:48.0283073Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:48.0283290Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.0283413Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:48.0283661Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:48.0283783Z U free@GLIBC_2.2.5 2025-05-07T20:10:48.0283952Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.0284052Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:48.0284195Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.0284319Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.0284416Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.0284533Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.0284679Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.0284776Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:48.0285009Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:48.0285361Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.0285737Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.0286051Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:48.0286437Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.0286788Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:48.0286904Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.0287074Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:48.0287218Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.0287358Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.0287548Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.0287680Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:48.0287819Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:48.0288081Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.0288630Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.0288761Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:48.0288911Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.0289036Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.0289182Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.0289326Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.0289511Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.0289747Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.0289895Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.0290056Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:48.0290192Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:48.0290617Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:48.0290762Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:48.0290874Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.0290997Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:48.0291093Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.0291218Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.0291881Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.0292322Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.0292575Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.0292754Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:48.0293043Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:48.0293232Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:48.0293462Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:48.0293650Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:48.0293987Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:48.0294171Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:48.0294358Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:48.0294535Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:48.0294720Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:48.0294836Z U torch::autograd::Node::metadata() 2025-05-07T20:10:48.0294978Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:48.0295248Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:48.0295507Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:48.0295649Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:48.0295881Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:48.0296096Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:48.0298603Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:48.0298794Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:48.0298950Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:48.0299140Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:48.0299980Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:48.0300151Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:48.0315800Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:48.0316309Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.0316432Z U typeinfo for c10::Error 2025-05-07T20:10:48.0316621Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:48.0316936Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:48.0317085Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:48.0317411Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:48.0317545Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:48.0317712Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.0317908Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.0318076Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:48.0318244Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.0318385Z U vtable for c10::Error 2025-05-07T20:10:48.0318734Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.0318878Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:48.0319141Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.0319303Z U vtable for torch::autograd::Node 2025-05-07T20:10:48.0319490Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.0319646Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.0319765Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.0319874Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.0319977Z w __gmon_start__ 2025-05-07T20:10:48.0320110Z w __pthread_key_create 2025-05-07T20:10:48.0320230Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:48.0320351Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:48.0320535Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.0320834Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:48.0320842Z 2025-05-07T20:10:48.0320993Z linux-vdso.so.1 (0x00007fffac769000) 2025-05-07T20:10:48.0321128Z libc10.so => not found 2025-05-07T20:10:48.0321232Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.0321336Z libc10_cuda.so => not found 2025-05-07T20:10:48.0321499Z libnccl.so.2 => not found 2025-05-07T20:10:48.0321603Z libcuda.so.1 => not found 2025-05-07T20:10:48.0322268Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f94ff167000) 2025-05-07T20:10:48.0322746Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f94fda00000) 2025-05-07T20:10:48.0322887Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.0323077Z libtorch.so => not found 2025-05-07T20:10:48.0323186Z libtorch_cpu.so => not found 2025-05-07T20:10:48.0323323Z libtorch_cuda.so => not found 2025-05-07T20:10:48.0323501Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f94fd79c000) 2025-05-07T20:10:48.0323664Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f94ff137000) 2025-05-07T20:10:48.0323825Z libc.so.6 => /lib64/libc.so.6 (0x00007f94fd594000) 2025-05-07T20:10:48.0323970Z /lib64/ld-linux-x86-64.so.2 (0x00007f94ff178000) 2025-05-07T20:10:48.0324071Z libtorch.so => not found 2025-05-07T20:10:48.0324169Z libc10.so => not found 2025-05-07T20:10:48.0324300Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.0324399Z libc10_cuda.so => not found 2025-05-07T20:10:48.0324499Z libnccl.so.2 => not found 2025-05-07T20:10:48.0324626Z libcuda.so.1 => not found 2025-05-07T20:10:48.0324802Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.0324907Z libtorch_cpu.so => not found 2025-05-07T20:10:48.0325011Z libtorch_cuda.so => not found 2025-05-07T20:10:48.0325200Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f94ff0dd000) 2025-05-07T20:10:48.0325389Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f94ff0d8000) 2025-05-07T20:10:48.0325490Z libtorch.so => not found 2025-05-07T20:10:48.0325608Z libc10.so => not found 2025-05-07T20:10:48.0325714Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.0325846Z libc10_cuda.so => not found 2025-05-07T20:10:48.0325945Z libnccl.so.2 => not found 2025-05-07T20:10:48.0326074Z libcuda.so.1 => not found 2025-05-07T20:10:48.0326184Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.0326287Z libtorch_cpu.so => not found 2025-05-07T20:10:48.0326419Z libtorch_cuda.so => not found 2025-05-07T20:10:48.0326525Z libcudart.so.12 => not found 2025-05-07T20:10:48.0326658Z libm.so.6 => /lib64/libm.so.6 (0x00007f94feb25000) 2025-05-07T20:10:48.0326663Z 2025-05-07T20:10:48.0326784Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.0327148Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:48.0327153Z 2025-05-07T20:10:48.0350354Z 2025-05-07T20:10:48.0351024Z Dynamic section at offset 0x4b5fc8 contains 40 entries: 2025-05-07T20:10:48.0351389Z Tag Type Name/Value 2025-05-07T20:10:48.0352177Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.0352792Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.0353410Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.0353983Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.0354387Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.0354636Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:48.0354868Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:48.0355087Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.0355294Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.0355528Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.0355746Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.0355955Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.0356223Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.0356419Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.0356639Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:48.0356971Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:10:48.0357162Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:48.0357290Z 0x000000000000000c (INIT) 0xd6000 2025-05-07T20:10:48.0357436Z 0x000000000000000d (FINI) 0x3f64b8 2025-05-07T20:10:48.0357562Z 0x0000000000000019 (INIT_ARRAY) 0x4add80 2025-05-07T20:10:48.0357702Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:10:48.0357836Z 0x000000000000001a (FINI_ARRAY) 0x4adeb0 2025-05-07T20:10:48.0357997Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.0358127Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:48.0358250Z 0x0000000000000005 (STRTAB) 0x16e00 2025-05-07T20:10:48.0358395Z 0x0000000000000006 (SYMTAB) 0x55e0 2025-05-07T20:10:48.0358537Z 0x000000000000000a (STRSZ) 609767 (bytes) 2025-05-07T20:10:48.0358660Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.0358815Z 0x0000000000000003 (PLTGOT) 0x4b8fe8 2025-05-07T20:10:48.0358980Z 0x0000000000000002 (PLTRELSZ) 31704 (bytes) 2025-05-07T20:10:48.0359109Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.0359233Z 0x0000000000000017 (JMPREL) 0xcdaf0 2025-05-07T20:10:48.0359375Z 0x0000000000000007 (RELA) 0xad450 2025-05-07T20:10:48.0359520Z 0x0000000000000008 (RELASZ) 132768 (bytes) 2025-05-07T20:10:48.0359675Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.0359830Z 0x000000006ffffffe (VERNEED) 0xad340 2025-05-07T20:10:48.0359949Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:48.0360226Z 0x000000006ffffff0 (VERSYM) 0xabbe8 2025-05-07T20:10:48.0360371Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:10:48.0360479Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.0360483Z 2025-05-07T20:10:48.0360606Z ################################################################################ 2025-05-07T20:10:48.0360610Z 2025-05-07T20:10:48.0360614Z 2025-05-07T20:10:48.0360759Z ################################################################################ 2025-05-07T20:10:48.0361077Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.0361187Z [CHECK] Listing out library size: 2025-05-07T20:10:48.0361525Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.0361552Z 2025-05-07T20:10:48.0367523Z 339 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.0367531Z 2025-05-07T20:10:48.0368047Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.0368612Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.0368624Z 2025-05-07T20:10:48.1340370Z GLIBC_2.2.5 2025-05-07T20:10:48.1340847Z GLIBC_2.3 2025-05-07T20:10:48.1341749Z GLIBC_2.14 2025-05-07T20:10:48.1341775Z 2025-05-07T20:10:48.1341789Z 2025-05-07T20:10:48.1343158Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.1344872Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.1344889Z 2025-05-07T20:10:48.2302595Z GLIBCXX_3.4 2025-05-07T20:10:48.2303285Z GLIBCXX_3.4.9 2025-05-07T20:10:48.2303892Z GLIBCXX_3.4.20 2025-05-07T20:10:48.2304659Z GLIBCXX_3.4.21 2025-05-07T20:10:48.2304797Z 2025-05-07T20:10:48.2304802Z 2025-05-07T20:10:48.2324112Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.LWFNYPMJJT.symbols.txt 2025-05-07T20:10:48.2325099Z 2025-05-07T20:10:48.3253237Z 2025-05-07T20:10:48.3297169Z [CHECK] Total Number of symbols: 12626 2025-05-07T20:10:48.3344608Z [CHECK] Number of fbgemm symbols: 5267 2025-05-07T20:10:48.3364278Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.g62BYgZmTk.usymbols.txt 2025-05-07T20:10:48.3365860Z 2025-05-07T20:10:48.3415043Z 2025-05-07T20:10:48.3441207Z [CHECK] Listing out undefined symbols (171 total): 2025-05-07T20:10:48.3461926Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.3462664Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.3463084Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.3463533Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.3463986Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.3464577Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:48.3465015Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:48.3465401Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:48.3465822Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.3466239Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:48.3466596Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.3467022Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.3467362Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.3467740Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:48.3468096Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.3468478Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.3468854Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.3469191Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.3469547Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:48.3470064Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.3470385Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:48.3470795Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:48.3471195Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:48.3471786Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:48.3472491Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:48.3473097Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:10:48.3473708Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:10:48.3474723Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.3475630Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:48.3476129Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:48.3476598Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:48.3477057Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:48.3477569Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.3478048Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.3478481Z U c10::BoolType::get() 2025-05-07T20:10:48.3478834Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.3479299Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:48.3479732Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:48.3480444Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:48.3481663Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:48.3482746Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.3483312Z U c10::Error::what() const 2025-05-07T20:10:48.3483647Z U c10::FloatType::get() 2025-05-07T20:10:48.3483997Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.3484479Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.3484933Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.3485282Z U c10::IntType::get() 2025-05-07T20:10:48.3485667Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.3486062Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.3486468Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.3486828Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.3487231Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:48.3487648Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.3488037Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:48.3488710Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.3489345Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.3489746Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:10:48.3490113Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:48.3490471Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.3490880Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:48.3491250Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:10:48.3491643Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:48.3492033Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:48.3492393Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:48.3492783Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:48.3493113Z U c10::SymIntType::get() 2025-05-07T20:10:48.3493448Z U c10::TensorType::get() 2025-05-07T20:10:48.3493783Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.3494727Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:48.3495678Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:48.3496035Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:48.3496435Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:48.3496781Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:48.3497147Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:48.3497512Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:48.3497971Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:48.3498451Z U c10::cuda::device_count() 2025-05-07T20:10:48.3498793Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:48.3499198Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:48.3499610Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:48.3500110Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:48.3500734Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:48.3501164Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:48.3501939Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.3502895Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.3503783Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.3504783Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.3505911Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.3506742Z U c10::get_default_dtype() 2025-05-07T20:10:48.3507119Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:48.3507483Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:48.3508076Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:48.3508755Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:48.3509194Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:48.3509582Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.3509987Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:48.3510413Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:10:48.3510849Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:10:48.3511255Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:48.3511653Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:48.3512050Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:48.3512475Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:48.3513017Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:48.3513455Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:48.3513802Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:48.3514237Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.3514660Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:48.3515059Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:48.3515444Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:48.3515786Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:48.3516149Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:48.3516508Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:48.3516871Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:48.3517220Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:48.3517656Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:48.3518050Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:48.3518420Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:48.3518926Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:48.3519291Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:48.3519882Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.3520388Z U float at::Tensor::item() const 2025-05-07T20:10:48.3520766Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.3521372Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.3522195Z U free@GLIBC_2.2.5 2025-05-07T20:10:48.3522549Z U int at::Tensor::item() const 2025-05-07T20:10:48.3522981Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.3523559Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.3524022Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.3524499Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.3524945Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.3525329Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.3525673Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.3526031Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.3526383Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.3526749Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.3527373Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.3528272Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.3528916Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.3529324Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.3529729Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.3530193Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.3530814Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.3531756Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.3532610Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:48.3533009Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.3533381Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.3533780Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.3534136Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.3534585Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.3535254Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.3535865Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.3536227Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.3536575Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.3536904Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.3537682Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.3538802Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.3539609Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.3540599Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.3541294Z U typeinfo for c10::Error 2025-05-07T20:10:48.3541696Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.3542134Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.3542604Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.3542989Z U vtable for c10::Error 2025-05-07T20:10:48.3543605Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.3544322Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.3544856Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.3545298Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.3545646Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.3546014Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.3546369Z w __gmon_start__ 2025-05-07T20:10:48.3546690Z w __pthread_key_create 2025-05-07T20:10:48.3547072Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.3547584Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.3547944Z 2025-05-07T20:10:48.3548130Z linux-vdso.so.1 (0x00007ffd2c42d000) 2025-05-07T20:10:48.3548443Z libc10.so => not found 2025-05-07T20:10:48.3548741Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.3549057Z libc10_cuda.so => not found 2025-05-07T20:10:48.3549343Z libnccl.so.2 => not found 2025-05-07T20:10:48.3549648Z libcuda.so.1 => not found 2025-05-07T20:10:48.3550312Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f4456400000) 2025-05-07T20:10:48.3551047Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.3551382Z libtorch.so => not found 2025-05-07T20:10:48.3551688Z libtorch_cpu.so => not found 2025-05-07T20:10:48.3551981Z libtorch_cuda.so => not found 2025-05-07T20:10:48.3552299Z libcudart.so.12 => not found 2025-05-07T20:10:48.3552793Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f445619c000) 2025-05-07T20:10:48.3553204Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f446c4c7000) 2025-05-07T20:10:48.3553609Z libc.so.6 => /lib64/libc.so.6 (0x00007f4455f94000) 2025-05-07T20:10:48.3553967Z /lib64/ld-linux-x86-64.so.2 (0x00007f446c4fd000) 2025-05-07T20:10:48.3554304Z libc10.so => not found 2025-05-07T20:10:48.3554538Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.3554829Z libc10_cuda.so => not found 2025-05-07T20:10:48.3555097Z libnccl.so.2 => not found 2025-05-07T20:10:48.3555384Z libcuda.so.1 => not found 2025-05-07T20:10:48.3555890Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f4455a00000) 2025-05-07T20:10:48.3556792Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f446c4ba000) 2025-05-07T20:10:48.3557451Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.3557760Z libtorch.so => not found 2025-05-07T20:10:48.3558062Z libtorch_cpu.so => not found 2025-05-07T20:10:48.3558334Z libtorch_cuda.so => not found 2025-05-07T20:10:48.3558632Z libcudart.so.12 => not found 2025-05-07T20:10:48.3558952Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f446c462000) 2025-05-07T20:10:48.3559345Z libm.so.6 => /lib64/libm.so.6 (0x00007f446c387000) 2025-05-07T20:10:48.3559689Z libc10.so => not found 2025-05-07T20:10:48.3559936Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.3560220Z libc10_cuda.so => not found 2025-05-07T20:10:48.3560480Z libnccl.so.2 => not found 2025-05-07T20:10:48.3560754Z libcuda.so.1 => not found 2025-05-07T20:10:48.3561252Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f446c30c000) 2025-05-07T20:10:48.3561826Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.3562096Z libtorch.so => not found 2025-05-07T20:10:48.3562373Z libtorch_cpu.so => not found 2025-05-07T20:10:48.3562643Z libtorch_cuda.so => not found 2025-05-07T20:10:48.3562929Z libtorch.so => not found 2025-05-07T20:10:48.3563198Z libc10.so => not found 2025-05-07T20:10:48.3563445Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.3563746Z libc10_cuda.so => not found 2025-05-07T20:10:48.3564015Z libnccl.so.2 => not found 2025-05-07T20:10:48.3564312Z libcuda.so.1 => not found 2025-05-07T20:10:48.3564613Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.3564908Z libtorch_cpu.so => not found 2025-05-07T20:10:48.3565179Z libtorch_cuda.so => not found 2025-05-07T20:10:48.3565542Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f446c303000) 2025-05-07T20:10:48.3565913Z libtorch.so => not found 2025-05-07T20:10:48.3566188Z libc10.so => not found 2025-05-07T20:10:48.3566465Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.3566730Z libc10_cuda.so => not found 2025-05-07T20:10:48.3567047Z libnccl.so.2 => not found 2025-05-07T20:10:48.3567302Z libcuda.so.1 => not found 2025-05-07T20:10:48.3567749Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.3568022Z libtorch_cpu.so => not found 2025-05-07T20:10:48.3568311Z libtorch_cuda.so => not found 2025-05-07T20:10:48.3568613Z librt.so.1 => /lib64/librt.so.1 (0x00007f44567fb000) 2025-05-07T20:10:48.3568867Z 2025-05-07T20:10:48.3568983Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.3569459Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.3569814Z 2025-05-07T20:10:48.3581378Z 2025-05-07T20:10:48.3581989Z Dynamic section at offset 0x15292018 contains 40 entries: 2025-05-07T20:10:48.3583314Z Tag Type Name/Value 2025-05-07T20:10:48.3583970Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.3584491Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.3586275Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.3586810Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.3587365Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.3587909Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:48.3588500Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.3589056Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.3589582Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.3590141Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.3590673Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:48.3591235Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.3591768Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.3592304Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.3592913Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:48.3593517Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:10:48.3594102Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:48.3594635Z 0x000000000000000c (INIT) 0x453000 2025-05-07T20:10:48.3595021Z 0x000000000000000d (FINI) 0x1fe941c 2025-05-07T20:10:48.3595428Z 0x0000000000000019 (INIT_ARRAY) 0x152889a8 2025-05-07T20:10:48.3595987Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:10:48.3596414Z 0x000000000000001a (FINI_ARRAY) 0x15288c98 2025-05-07T20:10:48.3597006Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.3597388Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:48.3597728Z 0x0000000000000005 (STRTAB) 0x624b8 2025-05-07T20:10:48.3598090Z 0x0000000000000006 (SYMTAB) 0x184f0 2025-05-07T20:10:48.3598453Z 0x000000000000000a (STRSZ) 3694099 (bytes) 2025-05-07T20:10:48.3598852Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.3599208Z 0x0000000000000003 (PLTGOT) 0x152a8fe8 2025-05-07T20:10:48.3599612Z 0x0000000000000002 (PLTRELSZ) 14520 (bytes) 2025-05-07T20:10:48.3600053Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.3600394Z 0x0000000000000017 (JMPREL) 0x44ece0 2025-05-07T20:10:48.3600762Z 0x0000000000000007 (RELA) 0x3ee668 2025-05-07T20:10:48.3601119Z 0x0000000000000008 (RELASZ) 394872 (bytes) 2025-05-07T20:10:48.3601500Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.3602033Z 0x000000006ffffffe (VERNEED) 0x3ee578 2025-05-07T20:10:48.3602462Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:48.3602818Z 0x000000006ffffff0 (VERSYM) 0x3e82cc 2025-05-07T20:10:48.3603208Z 0x000000006ffffff9 (RELACOUNT) 1976 2025-05-07T20:10:48.3603547Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.3603762Z 2025-05-07T20:10:48.3603910Z ################################################################################ 2025-05-07T20:10:48.3604144Z 2025-05-07T20:10:48.3604148Z 2025-05-07T20:10:48.3604272Z ################################################################################ 2025-05-07T20:10:48.3604846Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:48.3605396Z [CHECK] Listing out library size: 2025-05-07T20:10:48.3605920Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:48.3606360Z 2025-05-07T20:10:48.3606618Z 1 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:48.3606979Z 2025-05-07T20:10:48.3607410Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:48.3608502Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.3609345Z 2025-05-07T20:10:48.3659155Z GLIBC_2.2.5 2025-05-07T20:10:48.3659972Z GLIBC_2.3 2025-05-07T20:10:48.3660574Z GLIBC_2.14 2025-05-07T20:10:48.3660923Z 2025-05-07T20:10:48.3660935Z 2025-05-07T20:10:48.3662288Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:48.3665145Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.3665815Z 2025-05-07T20:10:48.3720565Z GLIBCXX_3.4 2025-05-07T20:10:48.3721357Z GLIBCXX_3.4.9 2025-05-07T20:10:48.3721577Z GLIBCXX_3.4.18 2025-05-07T20:10:48.3722185Z GLIBCXX_3.4.20 2025-05-07T20:10:48.3722572Z GLIBCXX_3.4.21 2025-05-07T20:10:48.3722730Z 2025-05-07T20:10:48.3722735Z 2025-05-07T20:10:48.3741721Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.YD75UAQFOD.symbols.txt 2025-05-07T20:10:48.3742284Z 2025-05-07T20:10:48.3767422Z 2025-05-07T20:10:48.3792266Z [CHECK] Total Number of symbols: 357 2025-05-07T20:10:48.3807862Z [CHECK] Number of fbgemm symbols: 57 2025-05-07T20:10:48.3824445Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.OTtPjoODYh.usymbols.txt 2025-05-07T20:10:48.3825026Z 2025-05-07T20:10:48.3842467Z 2025-05-07T20:10:48.3866668Z [CHECK] Listing out undefined symbols (118 total): 2025-05-07T20:10:48.3884511Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.3885337Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.3885859Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.3886209Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.3886589Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.3887141Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.3887510Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:48.3888000Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:48.3888323Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:48.3888678Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.3889007Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.3889290Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.3889638Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.3889922Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.3890228Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.3890532Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.3890833Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.3891126Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:48.3891431Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.3891744Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:48.3892482Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.3893725Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.3894636Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.3895025Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.3895365Z U c10::IntType::get() 2025-05-07T20:10:48.3895708Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.3896105Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.3896537Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.3897242Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.3897864Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.3898207Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.3898535Z U c10::TensorType::get() 2025-05-07T20:10:48.3898849Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.3899877Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:48.3901037Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:48.3901503Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:48.3901875Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:48.3902241Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:48.3902574Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:48.3902925Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:48.3903393Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:48.3903872Z U c10::cuda::device_count() 2025-05-07T20:10:48.3904237Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:48.3904613Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:48.3905019Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:48.3905411Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:48.3905869Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:48.3906249Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:48.3907016Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.3907904Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.3908810Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.3909781Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.3910837Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.3911650Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:48.3912001Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:48.3912352Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:48.3912841Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:48.3913257Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:48.3913577Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:48.3913978Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.3914386Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:48.3914761Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:48.3915117Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:48.3915445Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:48.3915796Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:48.3916114Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:48.3916451Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:48.3916807Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:48.3917176Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:48.3917528Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:48.3917858Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:48.3918189Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:48.3918554Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:48.3918914Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:48.3919253Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.3919658Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.3920075Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.3920396Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.3920680Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.3920955Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.3921257Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.3921773Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.3922881Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.3923826Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.3924660Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:48.3925575Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:48.3926239Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.3926628Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:48.3927120Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.3927541Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.3928075Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.3928622Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.3929610Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.3930472Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.3930850Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.3931254Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.3931619Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.3932078Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.3932712Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.3933218Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.3933613Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.3933942Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.3934296Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.3935161Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.3936346Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.3937212Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.3937992Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.3938702Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.3939273Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.3939710Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.3940259Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.3940912Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.3941601Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.3942096Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.3942445Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.3942809Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.3943158Z w __gmon_start__ 2025-05-07T20:10:48.3943462Z w __pthread_key_create 2025-05-07T20:10:48.3943850Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.3944372Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:48.3944767Z 2025-05-07T20:10:48.3944910Z linux-vdso.so.1 (0x00007ffe50bf2000) 2025-05-07T20:10:48.3945224Z libtorch.so => not found 2025-05-07T20:10:48.3945517Z libc10.so => not found 2025-05-07T20:10:48.3945876Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.3946157Z libc10_cuda.so => not found 2025-05-07T20:10:48.3946465Z libnccl.so.2 => not found 2025-05-07T20:10:48.3946742Z libcuda.so.1 => not found 2025-05-07T20:10:48.3947064Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.3947365Z libtorch_cpu.so => not found 2025-05-07T20:10:48.3947689Z libtorch_cuda.so => not found 2025-05-07T20:10:48.3947981Z libcudart.so.12 => not found 2025-05-07T20:10:48.3948357Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fad2e760000) 2025-05-07T20:10:48.3948832Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fad2e70a000) 2025-05-07T20:10:48.3949282Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fad2e6dc000) 2025-05-07T20:10:48.3949706Z libc.so.6 => /lib64/libc.so.6 (0x00007fad2e4d4000) 2025-05-07T20:10:48.3950086Z /lib64/ld-linux-x86-64.so.2 (0x00007fad2ea3f000) 2025-05-07T20:10:48.3950486Z libm.so.6 => /lib64/libm.so.6 (0x00007fad2e3f9000) 2025-05-07T20:10:48.3950733Z 2025-05-07T20:10:48.3950853Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.3951376Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:48.3951769Z 2025-05-07T20:10:48.3959190Z 2025-05-07T20:10:48.3959848Z Dynamic section at offset 0x71b10 contains 39 entries: 2025-05-07T20:10:48.3960990Z Tag Type Name/Value 2025-05-07T20:10:48.3961457Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.3962035Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.3962614Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.3963178Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.3963712Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.3964267Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.3964807Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.3965371Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.3965931Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.3966463Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:48.3967019Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.3967547Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:48.3968091Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.3968641Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.3969197Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:48.3969828Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:10:48.3970338Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:10:48.3970717Z 0x000000000000000d (FINI) 0x316ac 2025-05-07T20:10:48.3971065Z 0x0000000000000019 (INIT_ARRAY) 0x71130 2025-05-07T20:10:48.3971510Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:10:48.3971874Z 0x000000000000001a (FINI_ARRAY) 0x71158 2025-05-07T20:10:48.3972324Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.3972743Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:48.3973119Z 0x0000000000000005 (STRTAB) 0x2ba8 2025-05-07T20:10:48.3973553Z 0x0000000000000006 (SYMTAB) 0xa18 2025-05-07T20:10:48.3973924Z 0x000000000000000a (STRSZ) 36158 (bytes) 2025-05-07T20:10:48.3974329Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.3974687Z 0x0000000000000003 (PLTGOT) 0x71fe8 2025-05-07T20:10:48.3975074Z 0x0000000000000002 (PLTRELSZ) 5520 (bytes) 2025-05-07T20:10:48.3975478Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.3975847Z 0x0000000000000017 (JMPREL) 0xdfa8 2025-05-07T20:10:48.3976216Z 0x0000000000000007 (RELA) 0xbcc8 2025-05-07T20:10:48.3976598Z 0x0000000000000008 (RELASZ) 8928 (bytes) 2025-05-07T20:10:48.3976998Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.3977342Z 0x000000006ffffffe (VERNEED) 0xbbb8 2025-05-07T20:10:48.3977679Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:48.3978027Z 0x000000006ffffff0 (VERSYM) 0xb8e6 2025-05-07T20:10:48.3978363Z 0x000000006ffffff9 (RELACOUNT) 162 2025-05-07T20:10:48.3978666Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.3978880Z 2025-05-07T20:10:48.3978991Z ################################################################################ 2025-05-07T20:10:48.3979214Z 2025-05-07T20:10:48.3979218Z 2025-05-07T20:10:48.3979341Z ################################################################################ 2025-05-07T20:10:48.3979938Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:48.3980455Z [CHECK] Listing out library size: 2025-05-07T20:10:48.3980916Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:48.3981316Z 2025-05-07T20:10:48.3981544Z 35 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:48.3981899Z 2025-05-07T20:10:48.3982313Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:48.3983320Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.3983934Z 2025-05-07T20:10:48.4092615Z GLIBC_2.2.5 2025-05-07T20:10:48.4093256Z GLIBC_2.3 2025-05-07T20:10:48.4093620Z GLIBC_2.14 2025-05-07T20:10:48.4093736Z 2025-05-07T20:10:48.4093764Z 2025-05-07T20:10:48.4094243Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:48.4095274Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.4095294Z 2025-05-07T20:10:48.4213491Z GLIBCXX_3.4 2025-05-07T20:10:48.4213691Z GLIBCXX_3.4.9 2025-05-07T20:10:48.4214194Z GLIBCXX_3.4.11 2025-05-07T20:10:48.4214308Z GLIBCXX_3.4.15 2025-05-07T20:10:48.4214396Z GLIBCXX_3.4.18 2025-05-07T20:10:48.4214732Z GLIBCXX_3.4.20 2025-05-07T20:10:48.4214818Z GLIBCXX_3.4.21 2025-05-07T20:10:48.4214825Z 2025-05-07T20:10:48.4214829Z 2025-05-07T20:10:48.4232642Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.EBX7nSVaEs.symbols.txt 2025-05-07T20:10:48.4232660Z 2025-05-07T20:10:48.4315248Z 2025-05-07T20:10:48.4341874Z [CHECK] Total Number of symbols: 1545 2025-05-07T20:10:48.4367515Z [CHECK] Number of fbgemm symbols: 211 2025-05-07T20:10:48.4385647Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.eMGi0Mzpvj.usymbols.txt 2025-05-07T20:10:48.4385674Z 2025-05-07T20:10:48.4417536Z 2025-05-07T20:10:48.4452321Z [CHECK] Listing out undefined symbols (266 total): 2025-05-07T20:10:48.4468877Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.4469238Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.4469355Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.4469556Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.4469719Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.4469866Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.4470211Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:48.4470351Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:48.4470478Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:48.4470636Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.4470784Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:48.4470897Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.4471176Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.4471306Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.4471426Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:48.4471540Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.4471647Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.4471781Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.4471894Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:10:48.4471994Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.4472122Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:48.4472221Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:48.4472329Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.4472435Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:48.4472591Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:48.4472740Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:48.4472912Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:48.4473136Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:48.4473262Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:48.4473411Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:48.4473610Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:48.4473770Z U at::TensorMaker::make_tensor() 2025-05-07T20:10:48.4473896Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:10:48.4474069Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:10:48.4474235Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:48.4474829Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.4475538Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.4475719Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.4475907Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.4476103Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:48.4476285Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.4476621Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.4476848Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:48.4476984Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:10:48.4477179Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.4477390Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:10:48.4477643Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:48.4477904Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:48.4478220Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:48.4478899Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:48.4479113Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:48.4479281Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.4479768Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.4480378Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.4480514Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:48.4480639Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:48.4480847Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:48.4480958Z U at::globalContext() 2025-05-07T20:10:48.4481099Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:10:48.4481256Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:48.4481358Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:48.4481476Z U bool at::Tensor::item() const 2025-05-07T20:10:48.4481599Z U c10::AnyType::get() 2025-05-07T20:10:48.4481780Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:48.4481988Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.4482112Z U c10::BoolType::get() 2025-05-07T20:10:48.4482280Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.4482465Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:48.4482608Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:48.4483142Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:48.4483938Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:48.4484323Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.4484432Z U c10::Error::what() const 2025-05-07T20:10:48.4484543Z U c10::GradMode::is_enabled() 2025-05-07T20:10:48.4484667Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:48.4484840Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.4485002Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:48.4485142Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:48.4485257Z U c10::IValue::isBoolList() const 2025-05-07T20:10:48.4485372Z U c10::IValue::isIntList() const 2025-05-07T20:10:48.4485507Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:48.4485617Z U c10::IValue::isTensorList() const 2025-05-07T20:10:48.4485762Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.4485892Z U c10::IntType::get() 2025-05-07T20:10:48.4486496Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.4486656Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.4486807Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.4486955Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.4487077Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.4487344Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:48.4487524Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:48.4487625Z U c10::StringType::get() 2025-05-07T20:10:48.4487761Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.4488166Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.4488301Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.4488414Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.4488542Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:48.4488710Z U c10::SymIntType::get() 2025-05-07T20:10:48.4488866Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:48.4488997Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:48.4489607Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:48.4489768Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:48.4489891Z U c10::TensorType::get() 2025-05-07T20:10:48.4490084Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:10:48.4490243Z U c10::Type::is_module() const 2025-05-07T20:10:48.4490382Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.4491085Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:48.4491222Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:48.4491384Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:48.4491514Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:48.4491625Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:48.4491769Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:48.4491881Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:48.4492196Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:48.4492305Z U c10::cuda::device_count() 2025-05-07T20:10:48.4492441Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:48.4492600Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:48.4492747Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:48.4492890Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:48.4493071Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:48.4493185Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:48.4493613Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.4494165Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.4494421Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.4494949Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.4495290Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.4495872Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.4496171Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:48.4496445Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:48.4496651Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:48.4496793Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:48.4496913Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:48.4497263Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:48.4497459Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:48.4497612Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:48.4497776Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:48.4497914Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:48.4498044Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.4498202Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:48.4498602Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.4498747Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:48.4498892Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:48.4499072Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:48.4499190Z U c10::throwNullDataPtrError() 2025-05-07T20:10:48.4501265Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:10:48.4501403Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:48.4501531Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:48.4501731Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.4501863Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:48.4502033Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:48.4502168Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:48.4502316Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:48.4502468Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:48.4502615Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:48.4502739Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:48.4502879Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:48.4503025Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:48.4503156Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:48.4503300Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:48.4503463Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:48.4503625Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:48.4503745Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:48.4503894Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:48.4504035Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:48.4504169Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:48.4504398Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:48.4504601Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.4504705Z U free@GLIBC_2.2.5 2025-05-07T20:10:48.4504858Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.4504993Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:48.4505110Z U long at::Tensor::item() const 2025-05-07T20:10:48.4505288Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.4505458Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.4505614Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.4505722Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:48.4505848Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.4505957Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.4506092Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.4506240Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.4506371Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.4506479Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:48.4506701Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:48.4507067Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.4507466Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.4507814Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:48.4508195Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:48.4508319Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.4508463Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:48.4508637Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.4508783Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.4508975Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.4509110Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:48.4509260Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:48.4509508Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.4510102Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.4510240Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:48.4510384Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.4510502Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.4510625Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.4510744Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.4510944Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.4511218Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.4511350Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.4511533Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:48.4511673Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:48.4511888Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:48.4512343Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:48.4512493Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:48.4512613Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.4512733Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:48.4512836Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.4512974Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.4513599Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.4514082Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.4514378Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.4514525Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:48.4514831Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:48.4515022Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:48.4515257Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:48.4515453Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:48.4515824Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:48.4516097Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:48.4516295Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:48.4516475Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:48.4516635Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:48.4516751Z U torch::autograd::Node::metadata() 2025-05-07T20:10:48.4516896Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:48.4517155Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:48.4517422Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:48.4517571Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:48.4517793Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:48.4518015Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:48.4524496Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:48.4524709Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:48.4524897Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:48.4525074Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:48.4525240Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:48.4525680Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:48.4526060Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.4526620Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:48.4526792Z U typeinfo for c10::Error 2025-05-07T20:10:48.4526907Z U typeinfo for c10::Type 2025-05-07T20:10:48.4527135Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:48.4527297Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:48.4527444Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:48.4527570Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:48.4527724Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.4527914Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.4528088Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:48.4528250Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.4528437Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.4528552Z U vtable for c10::Error 2025-05-07T20:10:48.4528663Z U vtable for c10::ListType 2025-05-07T20:10:48.4529019Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.4529161Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:48.4529447Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.4529593Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:48.4529716Z U vtable for torch::autograd::Node 2025-05-07T20:10:48.4529907Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.4530044Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.4530155Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.4530266Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.4530384Z w __gmon_start__ 2025-05-07T20:10:48.4530485Z w __pthread_key_create 2025-05-07T20:10:48.4530607Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:48.4530729Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:48.4530902Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.4531144Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:48.4531152Z 2025-05-07T20:10:48.4531301Z linux-vdso.so.1 (0x00007ffc101f4000) 2025-05-07T20:10:48.4531415Z libc10.so => not found 2025-05-07T20:10:48.4531521Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.4531621Z libc10_cuda.so => not found 2025-05-07T20:10:48.4531790Z libnccl.so.2 => not found 2025-05-07T20:10:48.4531881Z libcuda.so.1 => not found 2025-05-07T20:10:48.4532444Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fc29d859000) 2025-05-07T20:10:48.4533006Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fc29c600000) 2025-05-07T20:10:48.4533111Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.4533210Z libtorch.so => not found 2025-05-07T20:10:48.4533317Z libtorch_cpu.so => not found 2025-05-07T20:10:48.4533435Z libtorch_cuda.so => not found 2025-05-07T20:10:48.4533532Z libcudart.so.12 => not found 2025-05-07T20:10:48.4533704Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc29c39c000) 2025-05-07T20:10:48.4533860Z libm.so.6 => /lib64/libm.so.6 (0x00007fc29d77e000) 2025-05-07T20:10:48.4534011Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc29fe11000) 2025-05-07T20:10:48.4534139Z libc.so.6 => /lib64/libc.so.6 (0x00007fc29c194000) 2025-05-07T20:10:48.4534300Z /lib64/ld-linux-x86-64.so.2 (0x00007fc29fe47000) 2025-05-07T20:10:48.4534397Z libc10.so => not found 2025-05-07T20:10:48.4534495Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.4534592Z libc10_cuda.so => not found 2025-05-07T20:10:48.4534718Z libnccl.so.2 => not found 2025-05-07T20:10:48.4534846Z libcuda.so.1 => not found 2025-05-07T20:10:48.4534952Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.4535078Z libtorch.so => not found 2025-05-07T20:10:48.4535173Z libtorch_cpu.so => not found 2025-05-07T20:10:48.4535272Z libtorch_cuda.so => not found 2025-05-07T20:10:48.4535369Z libcudart.so.12 => not found 2025-05-07T20:10:48.4535498Z libtorch.so => not found 2025-05-07T20:10:48.4535591Z libc10.so => not found 2025-05-07T20:10:48.4535689Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.4535813Z libc10_cuda.so => not found 2025-05-07T20:10:48.4535905Z libnccl.so.2 => not found 2025-05-07T20:10:48.4536003Z libcuda.so.1 => not found 2025-05-07T20:10:48.4536108Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.4536225Z libtorch_cpu.so => not found 2025-05-07T20:10:48.4536329Z libtorch_cuda.so => not found 2025-05-07T20:10:48.4536435Z libcudart.so.12 => not found 2025-05-07T20:10:48.4536623Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fc29fdb3000) 2025-05-07T20:10:48.4536629Z 2025-05-07T20:10:48.4536743Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.4537009Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:48.4537014Z 2025-05-07T20:10:48.4567132Z 2025-05-07T20:10:48.4567811Z Dynamic section at offset 0x220d958 contains 42 entries: 2025-05-07T20:10:48.4568009Z Tag Type Name/Value 2025-05-07T20:10:48.4568298Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.4568588Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.4568916Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.4569175Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.4569454Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.4569801Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:48.4570164Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:48.4570530Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.4570898Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.4571292Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.4571660Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.4572086Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:48.4572302Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.4572506Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:48.4572743Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.4573029Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.4573261Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:48.4573548Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:10:48.4573749Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:48.4573879Z 0x000000000000000c (INIT) 0x56000 2025-05-07T20:10:48.4574037Z 0x000000000000000d (FINI) 0x1515ac 2025-05-07T20:10:48.4574164Z 0x0000000000000019 (INIT_ARRAY) 0x220b430 2025-05-07T20:10:48.4574303Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:10:48.4574441Z 0x000000000000001a (FINI_ARRAY) 0x220b4c0 2025-05-07T20:10:48.4574601Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.4574729Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:48.4574846Z 0x0000000000000005 (STRTAB) 0xbb50 2025-05-07T20:10:48.4574999Z 0x0000000000000006 (SYMTAB) 0x2a60 2025-05-07T20:10:48.4575208Z 0x000000000000000a (STRSZ) 242227 (bytes) 2025-05-07T20:10:48.4575333Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.4575506Z 0x0000000000000003 (PLTGOT) 0x220efe8 2025-05-07T20:10:48.4575661Z 0x0000000000000002 (PLTRELSZ) 16872 (bytes) 2025-05-07T20:10:48.4575782Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.4575930Z 0x0000000000000017 (JMPREL) 0x512d8 2025-05-07T20:10:48.4576054Z 0x0000000000000007 (RELA) 0x47af8 2025-05-07T20:10:48.4576197Z 0x0000000000000008 (RELASZ) 38880 (bytes) 2025-05-07T20:10:48.4576329Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.4576488Z 0x000000006ffffffe (VERNEED) 0x47998 2025-05-07T20:10:48.4576600Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:48.4576726Z 0x000000006ffffff0 (VERSYM) 0x46d84 2025-05-07T20:10:48.4576886Z 0x000000006ffffff9 (RELACOUNT) 571 2025-05-07T20:10:48.4577005Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.4577012Z 2025-05-07T20:10:48.4577138Z ################################################################################ 2025-05-07T20:10:48.4577178Z 2025-05-07T20:10:48.4577182Z 2025-05-07T20:10:48.4577328Z ################################################################################ 2025-05-07T20:10:48.4577576Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:48.4577695Z [CHECK] Listing out library size: 2025-05-07T20:10:48.4577971Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:48.4577976Z 2025-05-07T20:10:48.4583568Z 73 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:48.4584733Z 2025-05-07T20:10:48.4586013Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:48.4586649Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.4586660Z 2025-05-07T20:10:48.5015190Z GLIBC_2.2.5 2025-05-07T20:10:48.5015365Z GLIBC_2.3 2025-05-07T20:10:48.5015958Z GLIBC_2.14 2025-05-07T20:10:48.5015974Z 2025-05-07T20:10:48.5015987Z 2025-05-07T20:10:48.5016444Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:48.5017144Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.5017151Z 2025-05-07T20:10:48.5431038Z GLIBCXX_3.4 2025-05-07T20:10:48.5431143Z GLIBCXX_3.4.9 2025-05-07T20:10:48.5431240Z GLIBCXX_3.4.11 2025-05-07T20:10:48.5431361Z GLIBCXX_3.4.14 2025-05-07T20:10:48.5431455Z GLIBCXX_3.4.15 2025-05-07T20:10:48.5431702Z GLIBCXX_3.4.18 2025-05-07T20:10:48.5431800Z GLIBCXX_3.4.19 2025-05-07T20:10:48.5431931Z GLIBCXX_3.4.20 2025-05-07T20:10:48.5432029Z GLIBCXX_3.4.21 2025-05-07T20:10:48.5435319Z 2025-05-07T20:10:48.5435324Z 2025-05-07T20:10:48.5456005Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.4iWZ1Lb1Aa.symbols.txt 2025-05-07T20:10:48.5456027Z 2025-05-07T20:10:48.5805155Z 2025-05-07T20:10:48.5838541Z [CHECK] Total Number of symbols: 6648 2025-05-07T20:10:48.5862732Z [CHECK] Number of fbgemm symbols: 4516 2025-05-07T20:10:48.5881926Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.YInp99cJ3S.usymbols.txt 2025-05-07T20:10:48.5881969Z 2025-05-07T20:10:48.5919194Z 2025-05-07T20:10:48.5948516Z [CHECK] Listing out undefined symbols (465 total): 2025-05-07T20:10:48.5972829Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.5974792Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.5975515Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.5975722Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:10:48.5975994Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.5976164Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.5976324Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.5976470Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:48.5976606Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:48.5976736Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:48.5976900Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.5977023Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:48.5977138Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.5977284Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.5977395Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.5977516Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:48.5977653Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.5977825Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.5977928Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.5978031Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:10:48.5978144Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.5978252Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:48.5978353Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:48.5978476Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.5978580Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:10:48.5978677Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:48.5978870Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:48.5978998Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:48.5979122Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:48.5979269Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:48.5979402Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:10:48.5979513Z U at::SplitUntil32Bit::end() const 2025-05-07T20:10:48.5979658Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:10:48.5979892Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:10:48.5980223Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:10:48.5980412Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:48.5980658Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:10:48.5980855Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:10:48.5980994Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:10:48.5981157Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:10:48.5981281Z U at::TensorIteratorBase::numel() const 2025-05-07T20:10:48.5981435Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:10:48.5981662Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:10:48.5981884Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:10:48.5981994Z U at::TensorMaker::make_tensor() 2025-05-07T20:10:48.5982150Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:10:48.6004315Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:10:48.6004650Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.6005032Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.6005163Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:10:48.6005524Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:10:48.6005769Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:48.6005939Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:10:48.6006149Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:10:48.6006348Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.6006562Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:10:48.6006746Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:10:48.6006930Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:48.6007193Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:10:48.6007526Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:10:48.6007726Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.6008309Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6008941Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6009133Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:48.6009322Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:48.6009475Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:10:48.6010004Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6010213Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.6010548Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:10:48.6010793Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:48.6010928Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:10:48.6011127Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.6011251Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:10:48.6011429Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.6012025Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6012209Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:48.6012723Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6012945Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:48.6013272Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:48.6013453Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:48.6013911Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.6014441Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:48.6014621Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:10:48.6014863Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:48.6015016Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:10:48.6015279Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.6015483Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:10:48.6015775Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:48.6016116Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:48.6016766Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:48.6016942Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:10:48.6017234Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:10:48.6017391Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:48.6017577Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:48.6017768Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.6017888Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:48.6018376Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6019009Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6019321Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:10:48.6019461Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:10:48.6019627Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:48.6019862Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:10:48.6020021Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:10:48.6020397Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:10:48.6020533Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:48.6020700Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:48.6020885Z U at::get_num_threads() 2025-05-07T20:10:48.6020991Z U at::get_thread_num() 2025-05-07T20:10:48.6021223Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:10:48.6021368Z U at::internal::set_thread_num(int) 2025-05-07T20:10:48.6021649Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:10:48.6022445Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6023112Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6023399Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:48.6023567Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:10:48.6023705Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:48.6023887Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:10:48.6023995Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:48.6024145Z U bool at::Tensor::item() const 2025-05-07T20:10:48.6024290Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.6024519Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.6024651Z U c10::AnyType::get() 2025-05-07T20:10:48.6024830Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:48.6025018Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.6025258Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.6025366Z U c10::BoolType::get() 2025-05-07T20:10:48.6025486Z U c10::DeviceObjType::get() 2025-05-07T20:10:48.6025688Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.6025885Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:48.6026013Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:48.6026573Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:48.6027225Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:48.6027646Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.6027785Z U c10::Error::what() const 2025-05-07T20:10:48.6027895Z U c10::FloatType::get() 2025-05-07T20:10:48.6028047Z U c10::GradMode::is_enabled() 2025-05-07T20:10:48.6028190Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:48.6028354Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.6028538Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.6028726Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:48.6028856Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:48.6028974Z U c10::IValue::isBoolList() const 2025-05-07T20:10:48.6029112Z U c10::IValue::isIntList() const 2025-05-07T20:10:48.6029237Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:48.6029358Z U c10::IValue::isTensorList() const 2025-05-07T20:10:48.6029512Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.6029652Z U c10::InferenceMode::is_enabled() 2025-05-07T20:10:48.6029764Z U c10::IntType::get() 2025-05-07T20:10:48.6030256Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.6030499Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.6030639Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.6030771Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.6030923Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.6031158Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.6031296Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:48.6031450Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:48.6031570Z U c10::ScalarTypeType::get() 2025-05-07T20:10:48.6031859Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:48.6032217Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:10:48.6032388Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:48.6032527Z U c10::StringType::get() 2025-05-07T20:10:48.6032704Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:48.6032854Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.6033009Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:48.6033448Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.6033600Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.6033746Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:10:48.6033898Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:10:48.6034028Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:48.6034140Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.6034295Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:48.6034435Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:48.6034551Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:48.6034684Z U c10::SymIntType::get() 2025-05-07T20:10:48.6034887Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:48.6035015Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:48.6035489Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:48.6035683Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:48.6035800Z U c10::TensorType::get() 2025-05-07T20:10:48.6036621Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:10:48.6036822Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:10:48.6036932Z U c10::Type::is_module() const 2025-05-07T20:10:48.6037096Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.6037820Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:48.6037961Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:48.6038195Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:10:48.6038471Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:10:48.6038820Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:10:48.6038973Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:48.6039100Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:48.6039225Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:48.6039376Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:48.6039491Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:48.6039748Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:48.6039917Z U c10::cuda::current_device() 2025-05-07T20:10:48.6040153Z U c10::cuda::device_count() 2025-05-07T20:10:48.6040424Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:48.6040666Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:48.6040993Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:48.6041245Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:48.6041534Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:48.6041761Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:48.6042419Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.6043127Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.6043462Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.6044235Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.6044906Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.6045772Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.6046064Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:48.6046354Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:48.6048133Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:48.6048286Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:48.6048425Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:48.6048749Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:48.6048939Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:48.6049085Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:10:48.6049225Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:10:48.6049378Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:48.6049575Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:48.6049697Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:48.6049820Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.6050112Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:48.6050487Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.6050620Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:10:48.6050769Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:10:48.6050916Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:10:48.6051044Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:48.6051218Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:10:48.6051346Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:10:48.6051472Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:48.6051643Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:48.6051783Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:48.6051955Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:48.6052130Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:48.6052297Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:10:48.6052424Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:48.6052557Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:48.6052703Z U c10::report_overflow(char const*) 2025-05-07T20:10:48.6052831Z U c10::throwNullDataPtrError() 2025-05-07T20:10:48.6052967Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:10:48.6053105Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:48.6053230Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:48.6053444Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.6053596Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:48.6053699Z U ceil@GLIBC_2.2.5 2025-05-07T20:10:48.6053827Z U cublasGemmStridedBatchedEx 2025-05-07T20:10:48.6053956Z U cublasSetStream_v2 2025-05-07T20:10:48.6054100Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:48.6054243Z U cudaDeviceGetByPCIBusId@libcudart.so.12 2025-05-07T20:10:48.6054377Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:48.6054569Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:48.6054693Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:48.6054828Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:48.6054970Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:48.6055127Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:48.6055260Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:48.6055376Z U cudaFree@libcudart.so.12 2025-05-07T20:10:48.6055534Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:48.6055667Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:48.6055789Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:48.6055942Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:48.6056087Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:48.6056217Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:48.6056369Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:48.6056512Z U cudaHostGetDevicePointer@libcudart.so.12 2025-05-07T20:10:48.6056638Z U cudaHostRegister@libcudart.so.12 2025-05-07T20:10:48.6056768Z U cudaHostUnregister@libcudart.so.12 2025-05-07T20:10:48.6056920Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:48.6057081Z U cudaMallocManaged@libcudart.so.12 2025-05-07T20:10:48.6057200Z U cudaMemAdvise@libcudart.so.12 2025-05-07T20:10:48.6057363Z U cudaMemPrefetchAsync@libcudart.so.12 2025-05-07T20:10:48.6057491Z U cudaMemcpy2DAsync@libcudart.so.12 2025-05-07T20:10:48.6057615Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:48.6057768Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:48.6058076Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:48.6058237Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:48.6058363Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:48.6058502Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:48.6058640Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:48.6058775Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:48.6058952Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.6059129Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.6059231Z U exit@GLIBC_2.2.5 2025-05-07T20:10:48.6059383Z U exp10@GLIBC_2.2.5 2025-05-07T20:10:48.6059484Z U exp2@GLIBC_2.2.5 2025-05-07T20:10:48.6059583Z U exp@GLIBC_2.2.5 2025-05-07T20:10:48.6059681Z U expf@GLIBC_2.2.5 2025-05-07T20:10:48.6059995Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:48.6060203Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:48.6060409Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:48.6060637Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:48.6060837Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:48.6060988Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.6061175Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.6061277Z U fmod@GLIBC_2.2.5 2025-05-07T20:10:48.6061377Z U free@GLIBC_2.2.5 2025-05-07T20:10:48.6061520Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:10:48.6061639Z U int at::Tensor::item() const 2025-05-07T20:10:48.6061835Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:10:48.6061992Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.6062145Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.6062248Z U isnan@GLIBC_2.2.5 2025-05-07T20:10:48.6062382Z U lgamma@GLIBC_2.2.5 2025-05-07T20:10:48.6062506Z U llrint@GLIBC_2.2.5 2025-05-07T20:10:48.6062617Z U llround@GLIBC_2.2.5 2025-05-07T20:10:48.6062715Z U log10@GLIBC_2.2.5 2025-05-07T20:10:48.6062839Z U log2@GLIBC_2.2.5 2025-05-07T20:10:48.6062939Z U log@GLIBC_2.2.5 2025-05-07T20:10:48.6063036Z U logl@GLIBC_2.2.5 2025-05-07T20:10:48.6063181Z U long at::Tensor::item() const 2025-05-07T20:10:48.6063363Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.6063541Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:10:48.6063682Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.6063865Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.6063966Z U lrint@GLIBC_2.2.5 2025-05-07T20:10:48.6064072Z U madvise@GLIBC_2.2.5 2025-05-07T20:10:48.6064195Z U malloc@GLIBC_2.2.5 2025-05-07T20:10:48.6064324Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:48.6064429Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.6064562Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.6064659Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.6064764Z U nextafter@GLIBC_2.2.5 2025-05-07T20:10:48.6064880Z U nvmlDeviceGetCount_v2 2025-05-07T20:10:48.6065033Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:10:48.6065176Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:10:48.6065300Z U nvmlDeviceGetNvLinkState 2025-05-07T20:10:48.6065440Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:10:48.6065536Z U nvmlInit_v2 2025-05-07T20:10:48.6065659Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.6065791Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.6065945Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.6066052Z U pow@GLIBC_2.2.5 2025-05-07T20:10:48.6066154Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:48.6066343Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.6066628Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.6066732Z U sin@GLIBC_2.2.5 2025-05-07T20:10:48.6066976Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:48.6067160Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:48.6067360Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:10:48.6067566Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:48.6067957Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:48.6068313Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.6068743Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.6069080Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:48.6069491Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.6069898Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:48.6070048Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:10:48.6070174Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:10:48.6070320Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.6070443Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:48.6070559Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:10:48.6070722Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:48.6070877Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.6071024Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.6071205Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.6071386Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.6071527Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:48.6071697Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:48.6071914Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:10:48.6072294Z U std::basic_ifstream >::basic_ifstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:10:48.6072571Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:10:48.6072820Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.6073171Z U std::basic_ofstream >::basic_ofstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:10:48.6073447Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:10:48.6074039Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.6074228Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:10:48.6074338Z U std::cout@GLIBCXX_3.4 2025-05-07T20:10:48.6074507Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:10:48.6074667Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:48.6074816Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.6074949Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.6075072Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.6075224Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.6075429Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:10:48.6075617Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.6075886Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.6076014Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:10:48.6076147Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.6076296Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:10:48.6076449Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:10:48.6076621Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:48.6076788Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:48.6077002Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:48.6077440Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:48.6077633Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:48.6077752Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.6077858Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:48.6077978Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.6078079Z U sysconf@GLIBC_2.2.5 2025-05-07T20:10:48.6078214Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.6078844Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.6079324Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.6079839Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:10:48.6080128Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.6080287Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:48.6080594Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:48.6080811Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:48.6081028Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:48.6081224Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:48.6081609Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:48.6081770Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:48.6081975Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:48.6082186Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:48.6082317Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:48.6082440Z U torch::autograd::Node::metadata() 2025-05-07T20:10:48.6082639Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:48.6082902Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:48.6083183Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:48.6083362Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:48.6083582Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:48.6083813Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:48.6086542Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:48.6086739Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:48.6086921Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:48.6087089Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:48.6087279Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:48.6087695Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:48.6088069Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.6088491Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.6088697Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:10:48.6088864Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:10:48.6089900Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:48.6090147Z U typeinfo for c10::Error 2025-05-07T20:10:48.6090343Z U typeinfo for c10::Type 2025-05-07T20:10:48.6090623Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:48.6090839Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:48.6091080Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:48.6091334Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:48.6091526Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:48.6091811Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.6092220Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.6093020Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:48.6093732Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:48.6094284Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:48.6094829Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:48.6095293Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:10:48.6095842Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:10:48.6096331Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:10:48.6096919Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:48.6097450Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:10:48.6098079Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:48.6098707Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:48.6098867Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.6099069Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.6099257Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:48.6099421Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.6099590Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.6099724Z U vtable for at::TensorIterator 2025-05-07T20:10:48.6099938Z U vtable for at::TensorIteratorBase 2025-05-07T20:10:48.6100047Z U vtable for c10::Error 2025-05-07T20:10:48.6100179Z U vtable for c10::ListType 2025-05-07T20:10:48.6100517Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.6100654Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:48.6100945Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.6101123Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:48.6101242Z U vtable for torch::autograd::Node 2025-05-07T20:10:48.6101448Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.6101569Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.6101682Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.6101794Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.6101920Z w __gmon_start__ 2025-05-07T20:10:48.6102021Z w __pthread_key_create 2025-05-07T20:10:48.6102139Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:48.6102287Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:48.6102393Z w pthread_once 2025-05-07T20:10:48.6102545Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.6102730Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:48.6102757Z 2025-05-07T20:10:48.6102875Z linux-vdso.so.1 (0x00007ffef2df4000) 2025-05-07T20:10:48.6102972Z libc10.so => not found 2025-05-07T20:10:48.6103078Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6103230Z libc10_cuda.so => not found 2025-05-07T20:10:48.6103331Z libnccl.so.2 => not found 2025-05-07T20:10:48.6103431Z libcuda.so.1 => not found 2025-05-07T20:10:48.6103834Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f8bd3800000) 2025-05-07T20:10:48.6104415Z fbgemm_gpu_embedding_inplace_ops.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so (0x00007f8bd3d8d000) 2025-05-07T20:10:48.6104937Z fbgemm_gpu_tbe_index_select.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so (0x00007f8bd1400000) 2025-05-07T20:10:48.6105425Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f8bcfc00000) 2025-05-07T20:10:48.6105942Z fbgemm_gpu_tbe_optimizers.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so (0x00007f8bcf200000) 2025-05-07T20:10:48.6106056Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6106180Z libtorch.so => not found 2025-05-07T20:10:48.6106733Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f8bcf059000) 2025-05-07T20:10:48.6107392Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f8bcde00000) 2025-05-07T20:10:48.6107599Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6107806Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6107969Z libcudart.so.12 => not found 2025-05-07T20:10:48.6108311Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f8bcdb9c000) 2025-05-07T20:10:48.6108545Z libm.so.6 => /lib64/libm.so.6 (0x00007f8bd1325000) 2025-05-07T20:10:48.6108839Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f8bd8b74000) 2025-05-07T20:10:48.6109068Z libc.so.6 => /lib64/libc.so.6 (0x00007f8bcd994000) 2025-05-07T20:10:48.6109329Z /lib64/ld-linux-x86-64.so.2 (0x00007f8bd8bac000) 2025-05-07T20:10:48.6109500Z libc10.so => not found 2025-05-07T20:10:48.6109675Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6109834Z libc10_cuda.so => not found 2025-05-07T20:10:48.6110027Z libnccl.so.2 => not found 2025-05-07T20:10:48.6110173Z libcuda.so.1 => not found 2025-05-07T20:10:48.6110691Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f8bd3789000) 2025-05-07T20:10:48.6110858Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6110985Z libtorch.so => not found 2025-05-07T20:10:48.6111113Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6111251Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6111401Z libtorch.so => not found 2025-05-07T20:10:48.6111580Z libc10.so => not found 2025-05-07T20:10:48.6111744Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6111925Z libc10_cuda.so => not found 2025-05-07T20:10:48.6112083Z libnccl.so.2 => not found 2025-05-07T20:10:48.6112227Z libcuda.so.1 => not found 2025-05-07T20:10:48.6112453Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6112668Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6112855Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6113025Z libcudart.so.12 => not found 2025-05-07T20:10:48.6113319Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f8bd12cf000) 2025-05-07T20:10:48.6113472Z libc10.so => not found 2025-05-07T20:10:48.6113632Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6113798Z libc10_cuda.so => not found 2025-05-07T20:10:48.6113951Z libnccl.so.2 => not found 2025-05-07T20:10:48.6114108Z libcuda.so.1 => not found 2025-05-07T20:10:48.6114277Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6114453Z libtorch.so => not found 2025-05-07T20:10:48.6114641Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6114841Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6115035Z libcudart.so.12 => not found 2025-05-07T20:10:48.6115222Z libtorch.so => not found 2025-05-07T20:10:48.6115441Z libc10.so => not found 2025-05-07T20:10:48.6115599Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6115772Z libc10_cuda.so => not found 2025-05-07T20:10:48.6115925Z libnccl.so.2 => not found 2025-05-07T20:10:48.6116089Z libcuda.so.1 => not found 2025-05-07T20:10:48.6116261Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6116458Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6116646Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6116808Z libcudart.so.12 => not found 2025-05-07T20:10:48.6117011Z libtorch.so => not found 2025-05-07T20:10:48.6117172Z libc10.so => not found 2025-05-07T20:10:48.6117328Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6117491Z libc10_cuda.so => not found 2025-05-07T20:10:48.6117642Z libnccl.so.2 => not found 2025-05-07T20:10:48.6117784Z libcuda.so.1 => not found 2025-05-07T20:10:48.6117919Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6118073Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6118208Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6118374Z libcudart.so.12 => not found 2025-05-07T20:10:48.6118523Z libc10.so => not found 2025-05-07T20:10:48.6118713Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6118887Z libc10_cuda.so => not found 2025-05-07T20:10:48.6119067Z libnccl.so.2 => not found 2025-05-07T20:10:48.6119266Z libcuda.so.1 => not found 2025-05-07T20:10:48.6119433Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6119649Z libtorch.so => not found 2025-05-07T20:10:48.6119818Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6120004Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6120170Z libcudart.so.12 => not found 2025-05-07T20:10:48.6120321Z libtorch.so => not found 2025-05-07T20:10:48.6120486Z libc10.so => not found 2025-05-07T20:10:48.6120730Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6120946Z libc10_cuda.so => not found 2025-05-07T20:10:48.6121113Z libnccl.so.2 => not found 2025-05-07T20:10:48.6121245Z libcuda.so.1 => not found 2025-05-07T20:10:48.6121350Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6121455Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6121580Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6121681Z libcudart.so.12 => not found 2025-05-07T20:10:48.6121779Z libtorch.so => not found 2025-05-07T20:10:48.6121872Z libc10.so => not found 2025-05-07T20:10:48.6122154Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6122253Z libc10_cuda.so => not found 2025-05-07T20:10:48.6122358Z libnccl.so.2 => not found 2025-05-07T20:10:48.6122476Z libcuda.so.1 => not found 2025-05-07T20:10:48.6122581Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6122689Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6122791Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6122990Z librt.so.1 => /lib64/librt.so.1 (0x00007f8bd3d88000) 2025-05-07T20:10:48.6123321Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f8bd3d83000) 2025-05-07T20:10:48.6123327Z 2025-05-07T20:10:48.6123445Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.6123682Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:48.6123689Z 2025-05-07T20:10:48.6123726Z 2025-05-07T20:10:48.6123896Z Dynamic section at offset 0x48e4fa8 contains 47 entries: 2025-05-07T20:10:48.6124050Z Tag Type Name/Value 2025-05-07T20:10:48.6124251Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.6124468Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.6124668Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.6124898Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.6125105Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.6125302Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:48.6125595Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:10:48.6125840Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:10:48.6126114Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:48.6126372Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:10:48.6126593Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.6126794Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.6127064Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:48.6127292Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:48.6127503Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.6127742Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.6127956Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:48.6128169Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.6128367Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:48.6128599Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.6128828Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.6129053Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:48.6129283Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:10:48.6129507Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:48.6129633Z 0x000000000000000c (INIT) 0x1bb000 2025-05-07T20:10:48.6129779Z 0x000000000000000d (FINI) 0x75816c 2025-05-07T20:10:48.6129905Z 0x0000000000000019 (INIT_ARRAY) 0x48d6858 2025-05-07T20:10:48.6130041Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:10:48.6130173Z 0x000000000000001a (FINI_ARRAY) 0x48d6ce0 2025-05-07T20:10:48.6130325Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.6130448Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:48.6130572Z 0x0000000000000005 (STRTAB) 0x33248 2025-05-07T20:10:48.6130714Z 0x0000000000000006 (SYMTAB) 0xc2f0 2025-05-07T20:10:48.6130861Z 0x000000000000000a (STRSZ) 1276767 (bytes) 2025-05-07T20:10:48.6130983Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.6131130Z 0x0000000000000003 (PLTGOT) 0x48eafe8 2025-05-07T20:10:48.6131270Z 0x0000000000000002 (PLTRELSZ) 68808 (bytes) 2025-05-07T20:10:48.6131412Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.6131528Z 0x0000000000000017 (JMPREL) 0x1a9648 2025-05-07T20:10:48.6131663Z 0x0000000000000007 (RELA) 0x16e320 2025-05-07T20:10:48.6131806Z 0x0000000000000008 (RELASZ) 242472 (bytes) 2025-05-07T20:10:48.6131929Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.6132074Z 0x000000006ffffffe (VERNEED) 0x16e1a0 2025-05-07T20:10:48.6132188Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:48.6132308Z 0x000000006ffffff0 (VERSYM) 0x16ada8 2025-05-07T20:10:48.6132431Z 0x000000006ffffff9 (RELACOUNT) 2870 2025-05-07T20:10:48.6132559Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.6132564Z 2025-05-07T20:10:48.6132682Z ################################################################################ 2025-05-07T20:10:48.6132687Z 2025-05-07T20:10:48.6132691Z 2025-05-07T20:10:48.6132829Z ################################################################################ 2025-05-07T20:10:48.6133149Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:48.6133258Z [CHECK] Listing out library size: 2025-05-07T20:10:48.6133591Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:48.6133616Z 2025-05-07T20:10:48.6133854Z 904 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:48.6133859Z 2025-05-07T20:10:48.6134290Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:48.6134848Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.6134853Z 2025-05-07T20:10:48.8131554Z GLIBC_2.2.5 2025-05-07T20:10:48.8131888Z GLIBC_2.3 2025-05-07T20:10:48.8132142Z GLIBC_2.14 2025-05-07T20:10:48.8132284Z 2025-05-07T20:10:48.8132289Z 2025-05-07T20:10:48.8132782Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:48.8133966Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.8134628Z 2025-05-07T20:10:49.0167031Z GLIBCXX_3.4 2025-05-07T20:10:49.0167321Z GLIBCXX_3.4.9 2025-05-07T20:10:49.0167606Z GLIBCXX_3.4.11 2025-05-07T20:10:49.0168017Z GLIBCXX_3.4.14 2025-05-07T20:10:49.0168288Z GLIBCXX_3.4.15 2025-05-07T20:10:49.0168518Z GLIBCXX_3.4.18 2025-05-07T20:10:49.0168783Z GLIBCXX_3.4.20 2025-05-07T20:10:49.0169017Z GLIBCXX_3.4.21 2025-05-07T20:10:49.0169188Z 2025-05-07T20:10:49.0169194Z 2025-05-07T20:10:49.0195579Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.9lbTFTKLLp.symbols.txt 2025-05-07T20:10:49.0197174Z 2025-05-07T20:10:49.2147434Z 2025-05-07T20:10:49.2230913Z [CHECK] Total Number of symbols: 12682 2025-05-07T20:10:49.2376538Z [CHECK] Number of fbgemm symbols: 2318 2025-05-07T20:10:49.2398749Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.laG5GPqFjO.usymbols.txt 2025-05-07T20:10:49.2399339Z 2025-05-07T20:10:49.2461843Z 2025-05-07T20:10:49.2497204Z [CHECK] Listing out undefined symbols (273 total): 2025-05-07T20:10:49.2514095Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.2515660Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.2516246Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:49.2516633Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.2517085Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.2517689Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.2518126Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:49.2518555Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:49.2518937Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:49.2519359Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.2519751Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:49.2520138Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:49.2520485Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:49.2520854Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:49.2521225Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:49.2521573Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:49.2522163Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:49.2522509Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:49.2522867Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:49.2523244Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:49.2523722Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:49.2524046Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:49.2524407Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:49.2524783Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:49.2525201Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:49.2525663Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:49.2526097Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:49.2526545Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:49.2526921Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:49.2527341Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:49.2527824Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:49.2528465Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:49.2529307Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:49.2530209Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2531459Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2532465Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:49.2533486Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2534569Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:49.2535133Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:49.2535565Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:49.2536276Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2537408Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2538287Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:49.2538695Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:49.2539074Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:49.2539459Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:49.2539949Z U at::get_thread_num() 2025-05-07T20:10:49.2540432Z U at::globalContext() 2025-05-07T20:10:49.2540801Z U at::internal::set_thread_num(int) 2025-05-07T20:10:49.2541220Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:49.2541647Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:10:49.2542130Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:10:49.2542496Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:49.2542832Z U c10::AnyType::get() 2025-05-07T20:10:49.2543247Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2543747Z U c10::BoolType::get() 2025-05-07T20:10:49.2544147Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:49.2544616Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:49.2545061Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:49.2545822Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:49.2547182Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:49.2548469Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.2549058Z U c10::Error::what() const 2025-05-07T20:10:49.2549398Z U c10::FloatType::get() 2025-05-07T20:10:49.2549752Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:49.2550086Z U c10::GradMode::is_enabled() 2025-05-07T20:10:49.2550439Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:49.2550849Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2551311Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2551757Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:49.2552167Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:49.2552556Z U c10::IValue::isBoolList() const 2025-05-07T20:10:49.2552893Z U c10::IValue::isIntList() const 2025-05-07T20:10:49.2553360Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:49.2553684Z U c10::IValue::isTensorList() const 2025-05-07T20:10:49.2554055Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:49.2554428Z U c10::IntType::get() 2025-05-07T20:10:49.2554784Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:49.2555194Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:49.2555541Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.2555913Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.2556353Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.2556830Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:49.2557202Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:49.2557717Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:49.2558272Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.2558637Z U c10::StringType::get() 2025-05-07T20:10:49.2559002Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:49.2559415Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:49.2560050Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:49.2560686Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:49.2561037Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:49.2561393Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:49.2561727Z U c10::SymIntType::get() 2025-05-07T20:10:49.2562082Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:49.2562487Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:49.2562904Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.2563334Z U c10::TensorType::get() 2025-05-07T20:10:49.2563814Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:49.2564782Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:49.2565750Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:49.2566103Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:49.2566461Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:49.2566799Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:49.2567150Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:49.2567505Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:49.2567969Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:49.2568620Z U c10::cuda::device_count() 2025-05-07T20:10:49.2569163Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:49.2569737Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:49.2570184Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:49.2570582Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:49.2571049Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:49.2571520Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:49.2572227Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.2573344Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:49.2574246Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:49.2575163Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.2576164Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:49.2577222Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.2578111Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:49.2578459Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:49.2579031Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:49.2579674Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:49.2580214Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:49.2580662Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:49.2581060Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:49.2581421Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.2581819Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:49.2582460Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.2583089Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:49.2583462Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:49.2583919Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:49.2584343Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:49.2584762Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:49.2585159Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:10:49.2585515Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:49.2585880Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:49.2586248Z U c10::throwNullDataPtrError() 2025-05-07T20:10:49.2586686Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:49.2587007Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:49.2587569Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:49.2588054Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:49.2588410Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:49.2588787Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:49.2589177Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:49.2589535Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:49.2589931Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:49.2590274Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:49.2590620Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:49.2590962Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:49.2591356Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:49.2591719Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:49.2592103Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:49.2592456Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:49.2592791Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:49.2593252Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:49.2593584Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:49.2593934Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:49.2594872Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:49.2596027Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:10:49.2596581Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:10:49.2597009Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:49.2597430Z U float at::Tensor::item() const 2025-05-07T20:10:49.2597790Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2598172Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2598528Z U free@GLIBC_2.2.5 2025-05-07T20:10:49.2598824Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2599200Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2599596Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:49.2600008Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2600389Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2600722Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:49.2601008Z U memcpy@GLIBC_2.14 2025-05-07T20:10:49.2601273Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:49.2601557Z U memset@GLIBC_2.2.5 2025-05-07T20:10:49.2601840Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:49.2602216Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:49.2602755Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.2603452Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.2604355Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.2605102Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.2605863Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.2606621Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.2607139Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:49.2607783Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:10:49.2608783Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:10:49.2609491Z U sqrt@GLIBC_2.2.5 2025-05-07T20:10:49.2609775Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:10:49.2610186Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:49.2610824Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:49.2611637Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.2612410Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:49.2613182Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:49.2613741Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:49.2614077Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:49.2614426Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:49.2614789Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.2615175Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.2615603Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:49.2616009Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:49.2616383Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:49.2616840Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:49.2617730Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.2618487Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:49.2618848Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:49.2619196Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:49.2619524Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:49.2619951Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:49.2620517Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.2622776Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.2623277Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:49.2623685Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:49.2624130Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:49.2624815Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:49.2625516Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:49.2625908Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:49.2626223Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:49.2626535Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:49.2626847Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:49.2627699Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:49.2628979Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.2629839Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.2630375Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:49.2630963Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:49.2631599Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:49.2632145Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:49.2632668Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:49.2633361Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:49.2634136Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:49.2634582Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:49.2635069Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:49.2635476Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:49.2635827Z U torch::autograd::Node::metadata() 2025-05-07T20:10:49.2636354Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:49.2637004Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:49.2637680Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:49.2638347Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:49.2638839Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:49.2639416Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:49.2642600Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:49.2645652Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:49.2646097Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:49.2646573Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:49.2647030Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:49.2647759Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:49.2648676Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:49.2649703Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:49.2650507Z U typeinfo for c10::Error 2025-05-07T20:10:49.2651049Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.2651489Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:49.2651890Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:49.2652278Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:49.2652673Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:49.2654039Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:10:49.2656328Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:10:49.2657676Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:49.2658137Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:49.2658579Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:49.2659048Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:49.2659490Z U vtable for c10::Error 2025-05-07T20:10:49.2660121Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.2660751Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.2661228Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:49.2661715Z U vtable for torch::autograd::Node 2025-05-07T20:10:49.2662162Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.2662576Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:49.2662947Z w _ITM_registerTMCloneTable 2025-05-07T20:10:49.2663349Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:49.2663666Z w __gmon_start__ 2025-05-07T20:10:49.2663981Z w __pthread_key_create 2025-05-07T20:10:49.2664305Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:49.2664674Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:49.2665082Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:49.2665643Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:49.2666010Z 2025-05-07T20:10:49.2666155Z linux-vdso.so.1 (0x00007ffcc6fde000) 2025-05-07T20:10:49.2666464Z libc10.so => not found 2025-05-07T20:10:49.2666751Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2667038Z libc10_cuda.so => not found 2025-05-07T20:10:49.2667350Z libnccl.so.2 => not found 2025-05-07T20:10:49.2667622Z libcuda.so.1 => not found 2025-05-07T20:10:49.2668287Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f9746800000) 2025-05-07T20:10:49.2669366Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f9746400000) 2025-05-07T20:10:49.2670503Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f9781d29000) 2025-05-07T20:10:49.2671305Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2671594Z libtorch.so => not found 2025-05-07T20:10:49.2672149Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f9745e00000) 2025-05-07T20:10:49.2673140Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f9744c00000) 2025-05-07T20:10:49.2673825Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2674137Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2674427Z libcudart.so.12 => not found 2025-05-07T20:10:49.2674813Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f974499c000) 2025-05-07T20:10:49.2675258Z libm.so.6 => /lib64/libm.so.6 (0x00007f9747f25000) 2025-05-07T20:10:49.2675684Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f9747ef7000) 2025-05-07T20:10:49.2676120Z libc.so.6 => /lib64/libc.so.6 (0x00007f9744794000) 2025-05-07T20:10:49.2676499Z /lib64/ld-linux-x86-64.so.2 (0x00007f9781ed6000) 2025-05-07T20:10:49.2676876Z libtorch.so => not found 2025-05-07T20:10:49.2677149Z libc10.so => not found 2025-05-07T20:10:49.2677452Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2677742Z libc10_cuda.so => not found 2025-05-07T20:10:49.2678177Z libnccl.so.2 => not found 2025-05-07T20:10:49.2678447Z libcuda.so.1 => not found 2025-05-07T20:10:49.2678754Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2679048Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2679357Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2679663Z libcudart.so.12 => not found 2025-05-07T20:10:49.2679996Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f97467aa000) 2025-05-07T20:10:49.2680390Z libc10.so => not found 2025-05-07T20:10:49.2680654Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2680967Z libc10_cuda.so => not found 2025-05-07T20:10:49.2681277Z libnccl.so.2 => not found 2025-05-07T20:10:49.2681581Z libcuda.so.1 => not found 2025-05-07T20:10:49.2682200Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f9781d14000) 2025-05-07T20:10:49.2682884Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2683243Z libtorch.so => not found 2025-05-07T20:10:49.2683686Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2683998Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2684280Z libcudart.so.12 => not found 2025-05-07T20:10:49.2684579Z libc10.so => not found 2025-05-07T20:10:49.2684838Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2685133Z libc10_cuda.so => not found 2025-05-07T20:10:49.2685405Z libnccl.so.2 => not found 2025-05-07T20:10:49.2685720Z libcuda.so.1 => not found 2025-05-07T20:10:49.2686001Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2686310Z libtorch.so => not found 2025-05-07T20:10:49.2686611Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2686899Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2687253Z libcudart.so.12 => not found 2025-05-07T20:10:49.2687538Z libc10.so => not found 2025-05-07T20:10:49.2687885Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2688165Z libc10_cuda.so => not found 2025-05-07T20:10:49.2688472Z libnccl.so.2 => not found 2025-05-07T20:10:49.2688744Z libcuda.so.1 => not found 2025-05-07T20:10:49.2689301Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f9746733000) 2025-05-07T20:10:49.2689891Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2690212Z libtorch.so => not found 2025-05-07T20:10:49.2690505Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2690791Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2691098Z libtorch.so => not found 2025-05-07T20:10:49.2691365Z libc10.so => not found 2025-05-07T20:10:49.2691655Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2691932Z libc10_cuda.so => not found 2025-05-07T20:10:49.2692232Z libnccl.so.2 => not found 2025-05-07T20:10:49.2692503Z libcuda.so.1 => not found 2025-05-07T20:10:49.2692801Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2693093Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2693402Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2693716Z libcudart.so.12 => not found 2025-05-07T20:10:49.2694002Z libtorch.so => not found 2025-05-07T20:10:49.2694286Z libc10.so => not found 2025-05-07T20:10:49.2694541Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2694846Z libc10_cuda.so => not found 2025-05-07T20:10:49.2695153Z libnccl.so.2 => not found 2025-05-07T20:10:49.2695443Z libcuda.so.1 => not found 2025-05-07T20:10:49.2695720Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2696143Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2696425Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2696836Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f9747ef2000) 2025-05-07T20:10:49.2697257Z libtorch.so => not found 2025-05-07T20:10:49.2697517Z libc10.so => not found 2025-05-07T20:10:49.2697800Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2698066Z libc10_cuda.so => not found 2025-05-07T20:10:49.2698366Z libnccl.so.2 => not found 2025-05-07T20:10:49.2698633Z libcuda.so.1 => not found 2025-05-07T20:10:49.2698927Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2699205Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2699501Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2699914Z librt.so.1 => /lib64/librt.so.1 (0x00007f9747eed000) 2025-05-07T20:10:49.2700353Z 2025-05-07T20:10:49.2700476Z [CHECK] Displaying ELF information: 2025-05-07T20:10:49.2700996Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:49.2701391Z 2025-05-07T20:10:49.2701396Z 2025-05-07T20:10:49.2701569Z Dynamic section at offset 0x38775ba0 contains 45 entries: 2025-05-07T20:10:49.2701995Z Tag Type Name/Value 2025-05-07T20:10:49.2702419Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:49.2703007Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:49.2703567Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:49.2704096Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:49.2704647Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:49.2705187Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:49.2705790Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:49.2706390Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:49.2707009Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:49.2707555Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:49.2708064Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:49.2708620Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:49.2709212Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:49.2709772Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:49.2710316Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:49.2710874Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:49.2711442Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:49.2711959Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:49.2712627Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:49.2713154Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:49.2713773Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:49.2714361Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:49.2714781Z 0x000000000000000c (INIT) 0x652000 2025-05-07T20:10:49.2715164Z 0x000000000000000d (FINI) 0x2f6443c 2025-05-07T20:10:49.2715528Z 0x0000000000000019 (INIT_ARRAY) 0x3871d880 2025-05-07T20:10:49.2715934Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:10:49.2716335Z 0x000000000000001a (FINI_ARRAY) 0x3871dfa0 2025-05-07T20:10:49.2716731Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:49.2717200Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:49.2717564Z 0x0000000000000005 (STRTAB) 0x62978 2025-05-07T20:10:49.2717926Z 0x0000000000000006 (SYMTAB) 0x18470 2025-05-07T20:10:49.2718315Z 0x000000000000000a (STRSZ) 5120077 (bytes) 2025-05-07T20:10:49.2718712Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:49.2719063Z 0x0000000000000003 (PLTGOT) 0x38788fe8 2025-05-07T20:10:49.2719459Z 0x0000000000000002 (PLTRELSZ) 63264 (bytes) 2025-05-07T20:10:49.2719808Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:49.2720153Z 0x0000000000000017 (JMPREL) 0x641978 2025-05-07T20:10:49.2720508Z 0x0000000000000007 (RELA) 0x54ae50 2025-05-07T20:10:49.2720867Z 0x0000000000000008 (RELASZ) 1010472 (bytes) 2025-05-07T20:10:49.2721269Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:49.2721615Z 0x000000006ffffffe (VERNEED) 0x54ace0 2025-05-07T20:10:49.2722124Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:49.2722443Z 0x000000006ffffff0 (VERSYM) 0x5449c6 2025-05-07T20:10:49.2722986Z 0x000000006ffffff9 (RELACOUNT) 28262 2025-05-07T20:10:49.2723360Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:49.2723669Z 2025-05-07T20:10:49.2723793Z ################################################################################ 2025-05-07T20:10:49.2724028Z 2025-05-07T20:10:49.2724032Z 2025-05-07T20:10:49.2724187Z ################################################################################ 2025-05-07T20:10:49.2724755Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:49.2725329Z [CHECK] Listing out library size: 2025-05-07T20:10:49.2725854Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:49.2726309Z 2025-05-07T20:10:49.2726566Z 328 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:49.2726948Z 2025-05-07T20:10:49.2727425Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:49.2728529Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.2729201Z 2025-05-07T20:10:49.3289356Z GLIBC_2.2.5 2025-05-07T20:10:49.3290453Z GLIBC_2.3 2025-05-07T20:10:49.3291002Z GLIBC_2.14 2025-05-07T20:10:49.3291331Z 2025-05-07T20:10:49.3291344Z 2025-05-07T20:10:49.3292756Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:49.3296049Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.3296838Z 2025-05-07T20:10:49.3927174Z GLIBCXX_3.4 2025-05-07T20:10:49.3927575Z GLIBCXX_3.4.9 2025-05-07T20:10:49.3927836Z GLIBCXX_3.4.11 2025-05-07T20:10:49.3928089Z GLIBCXX_3.4.18 2025-05-07T20:10:49.3928409Z GLIBCXX_3.4.20 2025-05-07T20:10:49.3928695Z GLIBCXX_3.4.21 2025-05-07T20:10:49.3928841Z 2025-05-07T20:10:49.3928846Z 2025-05-07T20:10:49.3947980Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.2aGTPhLAeX.symbols.txt 2025-05-07T20:10:49.3949599Z 2025-05-07T20:10:49.4542954Z 2025-05-07T20:10:49.4581987Z [CHECK] Total Number of symbols: 3739 2025-05-07T20:10:49.4623092Z [CHECK] Number of fbgemm symbols: 551 2025-05-07T20:10:49.4641780Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.r8b3layKCX.usymbols.txt 2025-05-07T20:10:49.4642375Z 2025-05-07T20:10:49.4669293Z 2025-05-07T20:10:49.4697217Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:10:49.4721880Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.4723201Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.4723819Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:49.4724230Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.4724650Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.4725086Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.4725487Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:49.4725912Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:49.4726286Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:49.4726695Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.4727087Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:49.4727418Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:49.4727776Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:49.4728105Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:49.4728461Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:49.4728883Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:49.4729242Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:49.4729605Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:49.4729968Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:49.4730411Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:49.4730855Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:49.4731337Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:49.4731820Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:49.4732728Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.4734118Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.4735179Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:49.4735935Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:49.4736931Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.4738058Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.4738863Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:49.4739303Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:49.4739647Z U at::globalContext() 2025-05-07T20:10:49.4740352Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.4740801Z U c10::BoolType::get() 2025-05-07T20:10:49.4741293Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:49.4741711Z U c10::FloatType::get() 2025-05-07T20:10:49.4742109Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:49.4742563Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.4743019Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:49.4743419Z U c10::IntType::get() 2025-05-07T20:10:49.4743865Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:49.4744294Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:49.4744724Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.4745160Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:49.4745603Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:49.4746053Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:49.4746540Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:49.4747266Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:49.4747949Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:49.4748397Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:49.4748793Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:49.4749249Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:49.4749681Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:49.4750077Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:49.4750486Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:49.4750835Z U c10::SymIntType::get() 2025-05-07T20:10:49.4751253Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:49.4751738Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.4752134Z U c10::TensorType::get() 2025-05-07T20:10:49.4752520Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:49.4753607Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:49.4754610Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:49.4755021Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:49.4755421Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:49.4755925Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:49.4756264Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:49.4756629Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:49.4757097Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:49.4773959Z U c10::cuda::device_count() 2025-05-07T20:10:49.4774453Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:49.4774891Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:49.4775372Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:49.4775778Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:49.4776206Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:49.4776597Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:49.4777338Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:49.4778310Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:49.4779141Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.4780443Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:49.4781543Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.4782394Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:49.4782804Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:49.4783228Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:49.4783683Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:49.4784139Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:49.4784512Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:10:49.4784917Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:49.4785316Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:49.4785772Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:49.4786243Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:49.4786638Z U c10::throwNullDataPtrError() 2025-05-07T20:10:49.4787020Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:49.4787368Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:49.4787829Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:49.4788305Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:49.4788680Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:49.4789100Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:49.4789491Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:49.4789894Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:49.4790270Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:49.4790665Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:49.4791027Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:49.4791422Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:49.4791832Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:49.4793737Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:49.4794156Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:49.4794521Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:49.4794917Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:49.4795280Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:49.4795678Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:49.4796086Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:49.4798661Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:49.4801202Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:49.4801698Z U float at::Tensor::item() const 2025-05-07T20:10:49.4802089Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.4802663Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.4803104Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.4803548Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.4804016Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:49.4804480Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.4804886Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.4805282Z U memcpy@GLIBC_2.14 2025-05-07T20:10:49.4805590Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:49.4805915Z U memset@GLIBC_2.2.5 2025-05-07T20:10:49.4806236Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:49.4806667Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:49.4807265Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.4808018Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.4808785Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.4809577Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.4810367Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.4811342Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.4812145Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:49.4813035Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.4814025Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:49.4814838Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:49.4815505Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:49.4815857Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:49.4816262Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.4816692Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.4817126Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:49.4817574Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:49.4818173Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:49.4819304Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.4820403Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:49.4820780Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:49.4821176Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:49.4821531Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:49.4822196Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.4822878Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.4823380Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:49.4823786Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:49.4824175Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:49.4824541Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:49.4825424Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:49.4826618Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.4827505Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.4828287Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:49.4829331Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:49.4832027Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.4836221Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.4840302Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.4844338Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.4848368Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.4852351Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.4856185Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:10:49.4858162Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:49.4858626Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:49.4859067Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:49.4859826Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.4860724Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:49.4861203Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:49.4861594Z w _ITM_registerTMCloneTable 2025-05-07T20:10:49.4861938Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:49.4862293Z w __gmon_start__ 2025-05-07T20:10:49.4862594Z w __pthread_key_create 2025-05-07T20:10:49.4862949Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:49.4863304Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:49.4863713Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:49.4864271Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:49.4864656Z 2025-05-07T20:10:49.4864865Z linux-vdso.so.1 (0x00007ffdec9ef000) 2025-05-07T20:10:49.4865183Z libc10.so => not found 2025-05-07T20:10:49.4865485Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.4865816Z libc10_cuda.so => not found 2025-05-07T20:10:49.4866129Z libnccl.so.2 => not found 2025-05-07T20:10:49.4866408Z libcuda.so.1 => not found 2025-05-07T20:10:49.4867194Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f966b400000) 2025-05-07T20:10:49.4868009Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.4868303Z libtorch.so => not found 2025-05-07T20:10:49.4868614Z libtorch_cpu.so => not found 2025-05-07T20:10:49.4868898Z libtorch_cuda.so => not found 2025-05-07T20:10:49.4869210Z libcudart.so.12 => not found 2025-05-07T20:10:49.4869563Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f966b19c000) 2025-05-07T20:10:49.4870023Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f96ba224000) 2025-05-07T20:10:49.4870473Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f96ba1f6000) 2025-05-07T20:10:49.4870868Z libc.so.6 => /lib64/libc.so.6 (0x00007f966af94000) 2025-05-07T20:10:49.4871274Z /lib64/ld-linux-x86-64.so.2 (0x00007f96ba282000) 2025-05-07T20:10:49.4871614Z libc10.so => not found 2025-05-07T20:10:49.4871905Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.4872188Z libc10_cuda.so => not found 2025-05-07T20:10:49.4872491Z libnccl.so.2 => not found 2025-05-07T20:10:49.4872765Z libcuda.so.1 => not found 2025-05-07T20:10:49.4873472Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f9669800000) 2025-05-07T20:10:49.4874726Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f9669400000) 2025-05-07T20:10:49.4875814Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f9669259000) 2025-05-07T20:10:49.4876567Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.4876845Z libtorch.so => not found 2025-05-07T20:10:49.4877370Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f9668c00000) 2025-05-07T20:10:49.4878275Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f9667a00000) 2025-05-07T20:10:49.4878918Z libtorch_cpu.so => not found 2025-05-07T20:10:49.4879221Z libtorch_cuda.so => not found 2025-05-07T20:10:49.4879493Z libcudart.so.12 => not found 2025-05-07T20:10:49.4879810Z libm.so.6 => /lib64/libm.so.6 (0x00007f96ba115000) 2025-05-07T20:10:49.4880135Z libtorch.so => not found 2025-05-07T20:10:49.4880586Z libc10.so => not found 2025-05-07T20:10:49.4880846Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.4881142Z libc10_cuda.so => not found 2025-05-07T20:10:49.4881476Z libnccl.so.2 => not found 2025-05-07T20:10:49.4881916Z libcuda.so.1 => not found 2025-05-07T20:10:49.4882264Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.4882559Z libtorch_cpu.so => not found 2025-05-07T20:10:49.4882882Z libtorch_cuda.so => not found 2025-05-07T20:10:49.4883168Z libcudart.so.12 => not found 2025-05-07T20:10:49.4883475Z libc10.so => not found 2025-05-07T20:10:49.4883744Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.4884060Z libc10_cuda.so => not found 2025-05-07T20:10:49.4884334Z libnccl.so.2 => not found 2025-05-07T20:10:49.4884628Z libcuda.so.1 => not found 2025-05-07T20:10:49.4885281Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f96a51f5000) 2025-05-07T20:10:49.4885950Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.4886275Z libtorch.so => not found 2025-05-07T20:10:49.4886551Z libtorch_cpu.so => not found 2025-05-07T20:10:49.4886867Z libtorch_cuda.so => not found 2025-05-07T20:10:49.4887164Z libcudart.so.12 => not found 2025-05-07T20:10:49.4887475Z libc10.so => not found 2025-05-07T20:10:49.4887742Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.4888052Z libc10_cuda.so => not found 2025-05-07T20:10:49.4888369Z libnccl.so.2 => not found 2025-05-07T20:10:49.4888666Z libcuda.so.1 => not found 2025-05-07T20:10:49.4888971Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.4889257Z libtorch.so => not found 2025-05-07T20:10:49.4889556Z libtorch_cpu.so => not found 2025-05-07T20:10:49.4889839Z libtorch_cuda.so => not found 2025-05-07T20:10:49.4890149Z libcudart.so.12 => not found 2025-05-07T20:10:49.4890427Z libc10.so => not found 2025-05-07T20:10:49.4890708Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.4890985Z libc10_cuda.so => not found 2025-05-07T20:10:49.4891288Z libnccl.so.2 => not found 2025-05-07T20:10:49.4891564Z libcuda.so.1 => not found 2025-05-07T20:10:49.4892140Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f96a5178000) 2025-05-07T20:10:49.4892869Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.4893151Z libtorch.so => not found 2025-05-07T20:10:49.4893440Z libtorch_cpu.so => not found 2025-05-07T20:10:49.4893836Z libtorch_cuda.so => not found 2025-05-07T20:10:49.4894128Z libtorch.so => not found 2025-05-07T20:10:49.4894377Z libc10.so => not found 2025-05-07T20:10:49.4894636Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.4894892Z libc10_cuda.so => not found 2025-05-07T20:10:49.4895169Z libnccl.so.2 => not found 2025-05-07T20:10:49.4895427Z libcuda.so.1 => not found 2025-05-07T20:10:49.4895712Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.4896001Z libtorch_cpu.so => not found 2025-05-07T20:10:49.4896256Z libtorch_cuda.so => not found 2025-05-07T20:10:49.4896539Z libcudart.so.12 => not found 2025-05-07T20:10:49.4896807Z libtorch.so => not found 2025-05-07T20:10:49.4897112Z libc10.so => not found 2025-05-07T20:10:49.4897359Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.4897639Z libc10_cuda.so => not found 2025-05-07T20:10:49.4897896Z libnccl.so.2 => not found 2025-05-07T20:10:49.4898325Z libcuda.so.1 => not found 2025-05-07T20:10:49.4898588Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.4898884Z libtorch_cpu.so => not found 2025-05-07T20:10:49.4899167Z libtorch_cuda.so => not found 2025-05-07T20:10:49.4899484Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f96a516b000) 2025-05-07T20:10:49.4899944Z libtorch.so => not found 2025-05-07T20:10:49.4900359Z libc10.so => not found 2025-05-07T20:10:49.4900606Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.4900940Z libc10_cuda.so => not found 2025-05-07T20:10:49.4901205Z libnccl.so.2 => not found 2025-05-07T20:10:49.4901449Z libcuda.so.1 => not found 2025-05-07T20:10:49.4901721Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.4902000Z libtorch_cpu.so => not found 2025-05-07T20:10:49.4902295Z libtorch_cuda.so => not found 2025-05-07T20:10:49.4902628Z librt.so.1 => /lib64/librt.so.1 (0x00007f96a5162000) 2025-05-07T20:10:49.4902952Z 2025-05-07T20:10:49.4903069Z [CHECK] Displaying ELF information: 2025-05-07T20:10:49.4903591Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:49.4904003Z 2025-05-07T20:10:49.4904007Z 2025-05-07T20:10:49.4904177Z Dynamic section at offset 0x147859a8 contains 41 entries: 2025-05-07T20:10:49.4904599Z Tag Type Name/Value 2025-05-07T20:10:49.4905050Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:49.4905570Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:49.4906123Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:49.4906656Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:49.4907194Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:49.4907771Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:49.4908387Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:49.4908940Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:49.4909494Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:49.4910046Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:49.4910583Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:49.4911145Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:49.4911667Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:49.4912215Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:49.4912762Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:49.4913293Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:49.4913932Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:10:49.4914736Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:49.4915151Z 0x000000000000000c (INIT) 0x1dc000 2025-05-07T20:10:49.4915476Z 0x000000000000000d (FINI) 0xe754cc 2025-05-07T20:10:49.4915827Z 0x0000000000000019 (INIT_ARRAY) 0x1476a588 2025-05-07T20:10:49.4916194Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:10:49.4916567Z 0x000000000000001a (FINI_ARRAY) 0x1476a830 2025-05-07T20:10:49.4916929Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:49.4917260Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:49.4917604Z 0x0000000000000005 (STRTAB) 0x1c8a0 2025-05-07T20:10:49.4917961Z 0x0000000000000006 (SYMTAB) 0x6a00 2025-05-07T20:10:49.4918332Z 0x000000000000000a (STRSZ) 1486798 (bytes) 2025-05-07T20:10:49.4918706Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:49.4919043Z 0x0000000000000003 (PLTGOT) 0x1478afe8 2025-05-07T20:10:49.4919417Z 0x0000000000000002 (PLTRELSZ) 22152 (bytes) 2025-05-07T20:10:49.4919757Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:49.4920097Z 0x0000000000000017 (JMPREL) 0x1d5988 2025-05-07T20:10:49.4920423Z 0x0000000000000007 (RELA) 0x1896c8 2025-05-07T20:10:49.4920974Z 0x0000000000000008 (RELASZ) 312000 (bytes) 2025-05-07T20:10:49.4921341Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:49.4921752Z 0x000000006ffffffe (VERNEED) 0x1895a8 2025-05-07T20:10:49.4922470Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:49.4922841Z 0x000000006ffffff0 (VERSYM) 0x18786e 2025-05-07T20:10:49.4923300Z 0x000000006ffffff9 (RELACOUNT) 8035 2025-05-07T20:10:49.4923808Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:49.4924044Z 2025-05-07T20:10:49.4924166Z ################################################################################ 2025-05-07T20:10:49.4924407Z 2025-05-07T20:10:49.4924411Z 2025-05-07T20:10:49.4924529Z ################################################################################ 2025-05-07T20:10:49.4925111Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:49.4925684Z [CHECK] Listing out library size: 2025-05-07T20:10:49.4926199Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:49.4926646Z 2025-05-07T20:10:49.4926901Z 142 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:49.4927269Z 2025-05-07T20:10:49.4927718Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:49.4928839Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.4929520Z 2025-05-07T20:10:49.5110666Z GLIBC_2.2.5 2025-05-07T20:10:49.5111320Z GLIBC_2.3 2025-05-07T20:10:49.5111917Z GLIBC_2.14 2025-05-07T20:10:49.5112249Z 2025-05-07T20:10:49.5112262Z 2025-05-07T20:10:49.5113679Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:49.5116376Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.5117146Z 2025-05-07T20:10:49.5383066Z GLIBCXX_3.4 2025-05-07T20:10:49.5383712Z GLIBCXX_3.4.9 2025-05-07T20:10:49.5383964Z GLIBCXX_3.4.11 2025-05-07T20:10:49.5384219Z GLIBCXX_3.4.18 2025-05-07T20:10:49.5384447Z GLIBCXX_3.4.20 2025-05-07T20:10:49.5384696Z GLIBCXX_3.4.21 2025-05-07T20:10:49.5384841Z 2025-05-07T20:10:49.5384846Z 2025-05-07T20:10:49.5406838Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.KHsyKYgESz.symbols.txt 2025-05-07T20:10:49.5408528Z 2025-05-07T20:10:49.5649840Z 2025-05-07T20:10:49.5678955Z [CHECK] Total Number of symbols: 1629 2025-05-07T20:10:49.5717792Z [CHECK] Number of fbgemm symbols: 227 2025-05-07T20:10:49.5740701Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.0P0xj0yd4e.usymbols.txt 2025-05-07T20:10:49.5741366Z 2025-05-07T20:10:49.5769665Z 2025-05-07T20:10:49.5809865Z [CHECK] Listing out undefined symbols (171 total): 2025-05-07T20:10:49.5826189Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.5827207Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.5827822Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:49.5828296Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.5828764Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.5829174Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.5829603Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:49.5830010Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:49.5830424Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:49.5830833Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.5831205Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:49.5831564Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:49.5831901Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:49.5832262Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:49.5832662Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:49.5833041Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:49.5833379Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:49.5833757Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:49.5834205Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:49.5834644Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:49.5835135Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:49.5835615Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:49.5836507Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.5837877Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.5838899Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:49.5839552Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:49.5840500Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.5841675Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.5842570Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:49.5843031Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:49.5843397Z U at::globalContext() 2025-05-07T20:10:49.5843844Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.5844285Z U c10::BoolType::get() 2025-05-07T20:10:49.5844688Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:49.5845220Z U c10::FloatType::get() 2025-05-07T20:10:49.5845594Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:49.5846028Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.5846468Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:49.5846849Z U c10::IntType::get() 2025-05-07T20:10:49.5847257Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:49.5847702Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:49.5848125Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.5848539Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:49.5848964Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:49.5849620Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:49.5850302Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:49.5850687Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:49.5851001Z U c10::SymIntType::get() 2025-05-07T20:10:49.5851558Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:49.5851990Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.5852380Z U c10::TensorType::get() 2025-05-07T20:10:49.5852881Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:49.5853819Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:49.5854786Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:49.5855153Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:49.5855527Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:49.5855908Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:49.5856263Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:49.5856633Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:49.5857106Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:49.5857605Z U c10::cuda::device_count() 2025-05-07T20:10:49.5857984Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:49.5858371Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:49.5858826Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:49.5859225Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:49.5859667Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:49.5860338Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:49.5861120Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:49.5862046Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:49.5862927Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.5863929Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:49.5865001Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.5865834Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:49.5866247Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:49.5866654Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:49.5867099Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:49.5867565Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:49.5867955Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:49.5868395Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:49.5868777Z U c10::throwNullDataPtrError() 2025-05-07T20:10:49.5869153Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:49.5869513Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:49.5869951Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:49.5870433Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:49.5870799Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:49.5871193Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:49.5871558Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:49.5871950Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:49.5872328Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:49.5872705Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:49.5873058Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:49.5873428Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:49.5873829Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:49.5874211Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:49.5874627Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:49.5875129Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:49.5875480Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:49.5875842Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:49.5876215Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:49.5876605Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:49.5879139Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:49.5881538Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:49.5882143Z U float at::Tensor::item() const 2025-05-07T20:10:49.5882534Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.5882960Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.5883541Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.5883963Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.5884417Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:49.5884852Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.5885281Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.5885650Z U memcpy@GLIBC_2.14 2025-05-07T20:10:49.5885977Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:49.5886280Z U memset@GLIBC_2.2.5 2025-05-07T20:10:49.5886664Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:49.5887016Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:49.5887618Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.5888434Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.5889204Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.5890008Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.5890828Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:49.5891693Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.5892548Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:49.5893377Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:49.5894032Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:49.5894396Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:49.5894772Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.5895320Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.5895742Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:49.5896188Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:49.5896696Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:49.5897789Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.5898674Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:49.5899055Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:49.5899410Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:49.5899863Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:49.5900427Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.5901011Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.5901520Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:49.5901872Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:49.5902198Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:49.5902524Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:49.5903372Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:49.5904567Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.5905417Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.5906167Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:49.5907263Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:49.5909401Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.5912428Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.5915370Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.5918368Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.5921493Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.5924578Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.5928142Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.5932311Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.5936477Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.5940607Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.5944554Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.5948527Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:49.5952340Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:10:49.5954320Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:49.5954764Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:49.5955241Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:49.5955841Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.5956531Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:49.5956982Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:49.5957336Z w _ITM_registerTMCloneTable 2025-05-07T20:10:49.5957660Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:49.5957966Z w __gmon_start__ 2025-05-07T20:10:49.5958266Z w __pthread_key_create 2025-05-07T20:10:49.5958574Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:49.5958924Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:49.5959285Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:49.5959814Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:49.5960186Z 2025-05-07T20:10:49.5960310Z linux-vdso.so.1 (0x00007ffc1f926000) 2025-05-07T20:10:49.5960603Z libc10.so => not found 2025-05-07T20:10:49.5960865Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.5961125Z libc10_cuda.so => not found 2025-05-07T20:10:49.5961409Z libnccl.so.2 => not found 2025-05-07T20:10:49.5961731Z libcuda.so.1 => not found 2025-05-07T20:10:49.5962496Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f83cca00000) 2025-05-07T20:10:49.5963284Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.5965099Z libtorch.so => not found 2025-05-07T20:10:49.5965388Z libtorch_cpu.so => not found 2025-05-07T20:10:49.5965659Z libtorch_cuda.so => not found 2025-05-07T20:10:49.5965949Z libcudart.so.12 => not found 2025-05-07T20:10:49.5966286Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f83cc79c000) 2025-05-07T20:10:49.5966724Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f840f9a7000) 2025-05-07T20:10:49.5967178Z libc.so.6 => /lib64/libc.so.6 (0x00007f83cc594000) 2025-05-07T20:10:49.5967557Z /lib64/ld-linux-x86-64.so.2 (0x00007f840f9dd000) 2025-05-07T20:10:49.5967935Z libc10.so => not found 2025-05-07T20:10:49.5968194Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.5968296Z libc10_cuda.so => not found 2025-05-07T20:10:49.5968406Z libnccl.so.2 => not found 2025-05-07T20:10:49.5968499Z libcuda.so.1 => not found 2025-05-07T20:10:49.5968985Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f83cae00000) 2025-05-07T20:10:49.5969456Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f83caa00000) 2025-05-07T20:10:49.5970060Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f840f7fe000) 2025-05-07T20:10:49.5970170Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.5970264Z libtorch.so => not found 2025-05-07T20:10:49.5970634Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f83ca400000) 2025-05-07T20:10:49.5971098Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f83c9200000) 2025-05-07T20:10:49.5971205Z libtorch_cpu.so => not found 2025-05-07T20:10:49.5971312Z libtorch_cuda.so => not found 2025-05-07T20:10:49.5971413Z libcudart.so.12 => not found 2025-05-07T20:10:49.5971552Z libm.so.6 => /lib64/libm.so.6 (0x00007f8406725000) 2025-05-07T20:10:49.5971672Z libtorch.so => not found 2025-05-07T20:10:49.5971772Z libc10.so => not found 2025-05-07T20:10:49.5971878Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.5971981Z libc10_cuda.so => not found 2025-05-07T20:10:49.5972089Z libnccl.so.2 => not found 2025-05-07T20:10:49.5972225Z libcuda.so.1 => not found 2025-05-07T20:10:49.5972338Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.5972455Z libtorch_cpu.so => not found 2025-05-07T20:10:49.5972627Z libtorch_cuda.so => not found 2025-05-07T20:10:49.5972735Z libcudart.so.12 => not found 2025-05-07T20:10:49.5972948Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f83cc53e000) 2025-05-07T20:10:49.5973064Z libc10.so => not found 2025-05-07T20:10:49.5973171Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.5973273Z libc10_cuda.so => not found 2025-05-07T20:10:49.5973390Z libnccl.so.2 => not found 2025-05-07T20:10:49.5973492Z libcuda.so.1 => not found 2025-05-07T20:10:49.5973947Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f840671a000) 2025-05-07T20:10:49.5974047Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.5974170Z libtorch.so => not found 2025-05-07T20:10:49.5974277Z libtorch_cpu.so => not found 2025-05-07T20:10:49.5974379Z libtorch_cuda.so => not found 2025-05-07T20:10:49.5974496Z libcudart.so.12 => not found 2025-05-07T20:10:49.5974596Z libc10.so => not found 2025-05-07T20:10:49.5974702Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.5974795Z libc10_cuda.so => not found 2025-05-07T20:10:49.5974910Z libnccl.so.2 => not found 2025-05-07T20:10:49.5975010Z libcuda.so.1 => not found 2025-05-07T20:10:49.5975148Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.5975261Z libtorch.so => not found 2025-05-07T20:10:49.5975362Z libtorch_cpu.so => not found 2025-05-07T20:10:49.5975466Z libtorch_cuda.so => not found 2025-05-07T20:10:49.5975559Z libcudart.so.12 => not found 2025-05-07T20:10:49.5975710Z libc10.so => not found 2025-05-07T20:10:49.5975822Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.5975920Z libc10_cuda.so => not found 2025-05-07T20:10:49.5976042Z libnccl.so.2 => not found 2025-05-07T20:10:49.5976147Z libcuda.so.1 => not found 2025-05-07T20:10:49.5976505Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f83cad89000) 2025-05-07T20:10:49.5976613Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.5976803Z libtorch.so => not found 2025-05-07T20:10:49.5976913Z libtorch_cpu.so => not found 2025-05-07T20:10:49.5977009Z libtorch_cuda.so => not found 2025-05-07T20:10:49.5977133Z libtorch.so => not found 2025-05-07T20:10:49.5977234Z libc10.so => not found 2025-05-07T20:10:49.5977332Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.5977429Z libc10_cuda.so => not found 2025-05-07T20:10:49.5977557Z libnccl.so.2 => not found 2025-05-07T20:10:49.5977656Z libcuda.so.1 => not found 2025-05-07T20:10:49.5977758Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.5977881Z libtorch_cpu.so => not found 2025-05-07T20:10:49.5978019Z libtorch_cuda.so => not found 2025-05-07T20:10:49.5978115Z libcudart.so.12 => not found 2025-05-07T20:10:49.5978213Z libtorch.so => not found 2025-05-07T20:10:49.5978338Z libc10.so => not found 2025-05-07T20:10:49.5978444Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.5978547Z libc10_cuda.so => not found 2025-05-07T20:10:49.5978675Z libnccl.so.2 => not found 2025-05-07T20:10:49.5978779Z libcuda.so.1 => not found 2025-05-07T20:10:49.5978889Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.5978995Z libtorch_cpu.so => not found 2025-05-07T20:10:49.5979128Z libtorch_cuda.so => not found 2025-05-07T20:10:49.5979315Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f840670d000) 2025-05-07T20:10:49.5979418Z libtorch.so => not found 2025-05-07T20:10:49.5979540Z libc10.so => not found 2025-05-07T20:10:49.5979645Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.5979833Z libc10_cuda.so => not found 2025-05-07T20:10:49.5979953Z libnccl.so.2 => not found 2025-05-07T20:10:49.5980081Z libcuda.so.1 => not found 2025-05-07T20:10:49.5980194Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.5980301Z libtorch_cpu.so => not found 2025-05-07T20:10:49.5980434Z libtorch_cuda.so => not found 2025-05-07T20:10:49.5980631Z librt.so.1 => /lib64/librt.so.1 (0x00007f8406704000) 2025-05-07T20:10:49.5980637Z 2025-05-07T20:10:49.5980805Z [CHECK] Displaying ELF information: 2025-05-07T20:10:49.5981133Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:49.5981138Z 2025-05-07T20:10:49.5981170Z 2025-05-07T20:10:49.5981345Z Dynamic section at offset 0x8d68cc8 contains 40 entries: 2025-05-07T20:10:49.5981470Z Tag Type Name/Value 2025-05-07T20:10:49.5981699Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:49.5981917Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:49.5982126Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:49.5982362Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:49.5982566Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:49.5982837Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:49.5983089Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:49.5983296Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:49.5983536Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:49.5983751Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:49.5983985Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:49.5984195Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:49.5984425Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:49.5984652Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:49.5984874Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:49.5985159Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:10:49.5985370Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:49.5985500Z 0x000000000000000c (INIT) 0xbe000 2025-05-07T20:10:49.5985626Z 0x000000000000000d (FINI) 0x5f04ec 2025-05-07T20:10:49.5985785Z 0x0000000000000019 (INIT_ARRAY) 0x8d5ea18 2025-05-07T20:10:49.5985920Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:10:49.5986052Z 0x000000000000001a (FINI_ARRAY) 0x8d5eae0 2025-05-07T20:10:49.5986179Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:49.5986327Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:49.5986475Z 0x0000000000000005 (STRTAB) 0xc600 2025-05-07T20:10:49.5986594Z 0x0000000000000006 (SYMTAB) 0x2d30 2025-05-07T20:10:49.5986763Z 0x000000000000000a (STRSZ) 597451 (bytes) 2025-05-07T20:10:49.5986895Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:49.5987024Z 0x0000000000000003 (PLTGOT) 0x8d6afe8 2025-05-07T20:10:49.5987198Z 0x0000000000000002 (PLTRELSZ) 12672 (bytes) 2025-05-07T20:10:49.5987316Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:49.5987441Z 0x0000000000000017 (JMPREL) 0xbab38 2025-05-07T20:10:49.5987561Z 0x0000000000000007 (RELA) 0x9f1a8 2025-05-07T20:10:49.5987729Z 0x0000000000000008 (RELASZ) 113040 (bytes) 2025-05-07T20:10:49.5987859Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:49.5987984Z 0x000000006ffffffe (VERNEED) 0x9f088 2025-05-07T20:10:49.5988134Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:49.5988258Z 0x000000006ffffff0 (VERSYM) 0x9e3cc 2025-05-07T20:10:49.5988380Z 0x000000006ffffff9 (RELACOUNT) 3303 2025-05-07T20:10:49.5988491Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:49.5988546Z 2025-05-07T20:10:49.5988669Z ################################################################################ 2025-05-07T20:10:49.5988674Z 2025-05-07T20:10:49.5988678Z 2025-05-07T20:10:49.5988796Z ################################################################################ 2025-05-07T20:10:49.5989169Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.5989280Z [CHECK] Listing out library size: 2025-05-07T20:10:49.5989611Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.5989615Z 2025-05-07T20:10:49.5989877Z 59 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.5989907Z 2025-05-07T20:10:49.5990361Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.5990917Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.5990924Z 2025-05-07T20:10:49.6135199Z GLIBC_2.2.5 2025-05-07T20:10:49.6135311Z GLIBC_2.3 2025-05-07T20:10:49.6135403Z GLIBC_2.14 2025-05-07T20:10:49.6140147Z 2025-05-07T20:10:49.6140153Z 2025-05-07T20:10:49.6140873Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.6141522Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.6141528Z 2025-05-07T20:10:49.6318112Z GLIBCXX_3.4 2025-05-07T20:10:49.6318262Z GLIBCXX_3.4.9 2025-05-07T20:10:49.6318360Z GLIBCXX_3.4.11 2025-05-07T20:10:49.6318454Z GLIBCXX_3.4.15 2025-05-07T20:10:49.6318576Z GLIBCXX_3.4.18 2025-05-07T20:10:49.6318669Z GLIBCXX_3.4.20 2025-05-07T20:10:49.6318774Z GLIBCXX_3.4.21 2025-05-07T20:10:49.6318781Z 2025-05-07T20:10:49.6318785Z 2025-05-07T20:10:49.6342192Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.M4vUFQxwXW.symbols.txt 2025-05-07T20:10:49.6342212Z 2025-05-07T20:10:49.6462869Z 2025-05-07T20:10:49.6489914Z [CHECK] Total Number of symbols: 1874 2025-05-07T20:10:49.6515827Z [CHECK] Number of fbgemm symbols: 100 2025-05-07T20:10:49.6534397Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.q1gS3Y0Viz.usymbols.txt 2025-05-07T20:10:49.6534408Z 2025-05-07T20:10:49.6565589Z 2025-05-07T20:10:49.6596737Z [CHECK] Listing out undefined symbols (259 total): 2025-05-07T20:10:49.6612610Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.6613931Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.6614244Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:49.6614703Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.6615123Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.6615503Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.6615945Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:49.6616379Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:49.6616584Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:49.6616733Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.6616862Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:49.6616996Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:49.6617109Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:49.6617223Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:49.6617411Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:49.6617553Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:49.6617666Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:49.6617779Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:49.6617917Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:49.6618035Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:49.6618141Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:49.6618290Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:49.6618400Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:49.6618522Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:49.6618682Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:49.6618889Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:49.6619031Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:49.6619156Z U at::RecordFunction::end() 2025-05-07T20:10:49.6619314Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:49.6619472Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:49.6619727Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:49.6620010Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:49.6620678Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.6621340Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.6621539Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:49.6622214Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.6622831Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.6622981Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:49.6623120Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:49.6623318Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:49.6623481Z U at::globalContext() 2025-05-07T20:10:49.6623619Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:49.6623726Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:49.6623863Z U c10::AnyType::get() 2025-05-07T20:10:49.6624080Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.6624189Z U c10::BoolType::get() 2025-05-07T20:10:49.6624383Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:49.6624576Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:49.6624692Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:49.6625235Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:49.6625871Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:49.6626297Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.6626408Z U c10::Error::what() const 2025-05-07T20:10:49.6626508Z U c10::FloatType::get() 2025-05-07T20:10:49.6626616Z U c10::GradMode::is_enabled() 2025-05-07T20:10:49.6626748Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:49.6626925Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.6627078Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:49.6627211Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:49.6627323Z U c10::IValue::isBoolList() const 2025-05-07T20:10:49.6627430Z U c10::IValue::isIntList() const 2025-05-07T20:10:49.6627549Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:49.6627663Z U c10::IValue::isTensorList() const 2025-05-07T20:10:49.6627805Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:49.6627916Z U c10::IntType::get() 2025-05-07T20:10:49.6628086Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:49.6628207Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:49.6628379Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.6628504Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.6628733Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.6629063Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:49.6629230Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.6629334Z U c10::StringType::get() 2025-05-07T20:10:49.6629480Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:49.6629636Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:49.6629806Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:49.6629951Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:49.6630123Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:49.6630531Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:49.6630672Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:49.6630810Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:49.6630996Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:49.6631115Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:49.6631257Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:49.6631387Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:49.6631514Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:49.6631639Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:49.6631743Z U c10::SymIntType::get() 2025-05-07T20:10:49.6631897Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:49.6632033Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:49.6632190Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.6632295Z U c10::TensorType::get() 2025-05-07T20:10:49.6632434Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:49.6633155Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:49.6633317Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:49.6633453Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:49.6633571Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:49.6633685Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:49.6633820Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:49.6633930Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:49.6634180Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:49.6634299Z U c10::cuda::device_count() 2025-05-07T20:10:49.6634552Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:49.6634676Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:49.6634825Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:49.6634993Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:49.6635145Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:49.6635253Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:49.6635703Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.6636209Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:49.6636472Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:49.6636940Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.6637258Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:49.6637831Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.6637951Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:49.6638058Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:49.6638386Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:49.6638568Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:49.6638739Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:49.6638914Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:49.6639035Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:49.6639159Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.6639317Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:49.6639665Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.6639781Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:49.6639922Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:49.6640049Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:49.6640199Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:49.6640346Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:49.6640473Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:49.6640603Z U c10::throwNullDataPtrError() 2025-05-07T20:10:49.6640708Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:49.6640812Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:49.6640996Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:49.6641119Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:49.6641260Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:49.6641383Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:49.6641520Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:49.6641652Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:49.6641773Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:49.6641895Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:49.6642027Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:49.6642174Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:49.6642459Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:49.6642604Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:49.6642742Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:49.6642887Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:49.6643005Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:49.6643142Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:49.6643276Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:49.6643418Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:49.6645614Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:49.6645885Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:49.6646043Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.6646201Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.6646297Z U free@GLIBC_2.2.5 2025-05-07T20:10:49.6646464Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.6646602Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.6646774Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:49.6646931Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.6647080Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.6647184Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:49.6647284Z U memcpy@GLIBC_2.14 2025-05-07T20:10:49.6647400Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:49.6647501Z U memset@GLIBC_2.2.5 2025-05-07T20:10:49.6647620Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:49.6647762Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:49.6648090Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.6648423Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.6648570Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:49.6648780Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:49.6649117Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:49.6649517Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.6649836Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:49.6650212Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.6650599Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:49.6650722Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:49.6650841Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:49.6651003Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.6651144Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.6651342Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:49.6651498Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:49.6651635Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:49.6651899Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:49.6652496Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.6652625Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:49.6652746Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:49.6652890Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:49.6653009Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:49.6653127Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:49.6653341Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.6653578Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.6653705Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:49.6653892Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:49.6654167Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:49.6654561Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:49.6654721Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:49.6654828Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:49.6654926Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:49.6655046Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:49.6655163Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:49.6655718Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:49.6656171Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.6656416Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.6656560Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:49.6656862Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:49.6657041Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:49.6657229Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:49.6657418Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:49.6657748Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:49.6657901Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:49.6658105Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:49.6658285Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:49.6658412Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:49.6658541Z U torch::autograd::Node::metadata() 2025-05-07T20:10:49.6658676Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:49.6658940Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:49.6659214Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:49.6659355Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:49.6659674Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:49.6659995Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:49.6662836Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:49.6663025Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:49.6663216Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:49.6663381Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:49.6664207Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:49.6664378Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:49.6664815Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:49.6665185Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:49.6665749Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:49.6665896Z U typeinfo for c10::Error 2025-05-07T20:10:49.6666046Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.6666184Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:49.6666336Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:49.6666471Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:49.6666596Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:49.6668094Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.6669628Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.6671184Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.6672593Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.6674049Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.6675505Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.6675694Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:49.6675865Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:49.6676049Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:49.6676210Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:49.6676323Z U vtable for c10::Error 2025-05-07T20:10:49.6676692Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.6676838Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.6677073Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:49.6677226Z U vtable for torch::autograd::Node 2025-05-07T20:10:49.6677411Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.6677556Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:49.6677665Z w _ITM_registerTMCloneTable 2025-05-07T20:10:49.6677801Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:49.6677895Z w __gmon_start__ 2025-05-07T20:10:49.6677995Z w __pthread_key_create 2025-05-07T20:10:49.6678143Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:49.6678266Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:49.6678418Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:49.6678714Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.6678724Z 2025-05-07T20:10:49.6678877Z linux-vdso.so.1 (0x00007ffd6a7d9000) 2025-05-07T20:10:49.6678974Z libc10.so => not found 2025-05-07T20:10:49.6679104Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.6679207Z libc10_cuda.so => not found 2025-05-07T20:10:49.6679309Z libnccl.so.2 => not found 2025-05-07T20:10:49.6679421Z libcuda.so.1 => not found 2025-05-07T20:10:49.6680009Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f0c6b600000) 2025-05-07T20:10:49.6680122Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.6680246Z libtorch.so => not found 2025-05-07T20:10:49.6680380Z libtorch_cpu.so => not found 2025-05-07T20:10:49.6680489Z libtorch_cuda.so => not found 2025-05-07T20:10:49.6680594Z libcudart.so.12 => not found 2025-05-07T20:10:49.6680791Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f0c6b39c000) 2025-05-07T20:10:49.6680976Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f0ca91aa000) 2025-05-07T20:10:49.6681136Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f0ca917c000) 2025-05-07T20:10:49.6681289Z libc.so.6 => /lib64/libc.so.6 (0x00007f0c6b194000) 2025-05-07T20:10:49.6681429Z /lib64/ld-linux-x86-64.so.2 (0x00007f0ca9208000) 2025-05-07T20:10:49.6681531Z libc10.so => not found 2025-05-07T20:10:49.6681638Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.6681755Z libc10_cuda.so => not found 2025-05-07T20:10:49.6681853Z libnccl.so.2 => not found 2025-05-07T20:10:49.6681953Z libcuda.so.1 => not found 2025-05-07T20:10:49.6682446Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f0c69a00000) 2025-05-07T20:10:49.6682917Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f0c69600000) 2025-05-07T20:10:49.6683474Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f0c69459000) 2025-05-07T20:10:49.6683624Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.6683716Z libtorch.so => not found 2025-05-07T20:10:49.6684082Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f0c68e00000) 2025-05-07T20:10:49.6684569Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f0c67c00000) 2025-05-07T20:10:49.6684671Z libtorch_cpu.so => not found 2025-05-07T20:10:49.6684773Z libtorch_cuda.so => not found 2025-05-07T20:10:49.6684873Z libcudart.so.12 => not found 2025-05-07T20:10:49.6685028Z libm.so.6 => /lib64/libm.so.6 (0x00007f0ca909b000) 2025-05-07T20:10:49.6685127Z libtorch.so => not found 2025-05-07T20:10:49.6685214Z libc10.so => not found 2025-05-07T20:10:49.6685333Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.6685431Z libc10_cuda.so => not found 2025-05-07T20:10:49.6685527Z libnccl.so.2 => not found 2025-05-07T20:10:49.6685623Z libcuda.so.1 => not found 2025-05-07T20:10:49.6685749Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.6685848Z libtorch_cpu.so => not found 2025-05-07T20:10:49.6685947Z libtorch_cuda.so => not found 2025-05-07T20:10:49.6686072Z libcudart.so.12 => not found 2025-05-07T20:10:49.6686193Z libc10.so => not found 2025-05-07T20:10:49.6686290Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.6686384Z libc10_cuda.so => not found 2025-05-07T20:10:49.6686501Z libnccl.so.2 => not found 2025-05-07T20:10:49.6686600Z libcuda.so.1 => not found 2025-05-07T20:10:49.6687046Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f0ca908a000) 2025-05-07T20:10:49.6687169Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.6687387Z libtorch.so => not found 2025-05-07T20:10:49.6687489Z libtorch_cpu.so => not found 2025-05-07T20:10:49.6687609Z libtorch_cuda.so => not found 2025-05-07T20:10:49.6687715Z libcudart.so.12 => not found 2025-05-07T20:10:49.6687804Z libc10.so => not found 2025-05-07T20:10:49.6687911Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.6688030Z libc10_cuda.so => not found 2025-05-07T20:10:49.6688127Z libnccl.so.2 => not found 2025-05-07T20:10:49.6688231Z libcuda.so.1 => not found 2025-05-07T20:10:49.6688343Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.6688437Z libtorch.so => not found 2025-05-07T20:10:49.6688540Z libtorch_cpu.so => not found 2025-05-07T20:10:49.6688638Z libtorch_cuda.so => not found 2025-05-07T20:10:49.6688751Z libcudart.so.12 => not found 2025-05-07T20:10:49.6688840Z libc10.so => not found 2025-05-07T20:10:49.6688968Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.6689070Z libc10_cuda.so => not found 2025-05-07T20:10:49.6689166Z libnccl.so.2 => not found 2025-05-07T20:10:49.6689261Z libcuda.so.1 => not found 2025-05-07T20:10:49.6689620Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f0ca900b000) 2025-05-07T20:10:49.6689757Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.6689861Z libtorch.so => not found 2025-05-07T20:10:49.6689962Z libtorch_cpu.so => not found 2025-05-07T20:10:49.6690079Z libtorch_cuda.so => not found 2025-05-07T20:10:49.6690170Z libtorch.so => not found 2025-05-07T20:10:49.6690268Z libc10.so => not found 2025-05-07T20:10:49.6690368Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.6690478Z libc10_cuda.so => not found 2025-05-07T20:10:49.6690576Z libnccl.so.2 => not found 2025-05-07T20:10:49.6690670Z libcuda.so.1 => not found 2025-05-07T20:10:49.6690788Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.6690884Z libtorch_cpu.so => not found 2025-05-07T20:10:49.6690985Z libtorch_cuda.so => not found 2025-05-07T20:10:49.6691086Z libcudart.so.12 => not found 2025-05-07T20:10:49.6691192Z libtorch.so => not found 2025-05-07T20:10:49.6691281Z libc10.so => not found 2025-05-07T20:10:49.6691379Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.6691493Z libc10_cuda.so => not found 2025-05-07T20:10:49.6691585Z libnccl.so.2 => not found 2025-05-07T20:10:49.6691706Z libcuda.so.1 => not found 2025-05-07T20:10:49.6691812Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.6691931Z libtorch_cpu.so => not found 2025-05-07T20:10:49.6692029Z libtorch_cuda.so => not found 2025-05-07T20:10:49.6692218Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f0ca53f3000) 2025-05-07T20:10:49.6692336Z libtorch.so => not found 2025-05-07T20:10:49.6692422Z libc10.so => not found 2025-05-07T20:10:49.6692522Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.6692618Z libc10_cuda.so => not found 2025-05-07T20:10:49.6692738Z libnccl.so.2 => not found 2025-05-07T20:10:49.6692831Z libcuda.so.1 => not found 2025-05-07T20:10:49.6692931Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.6693053Z libtorch_cpu.so => not found 2025-05-07T20:10:49.6693146Z libtorch_cuda.so => not found 2025-05-07T20:10:49.6693280Z librt.so.1 => /lib64/librt.so.1 (0x00007f0ca53ea000) 2025-05-07T20:10:49.6693285Z 2025-05-07T20:10:49.6693400Z [CHECK] Displaying ELF information: 2025-05-07T20:10:49.6693723Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.6693728Z 2025-05-07T20:10:49.6708393Z 2025-05-07T20:10:49.6709353Z Dynamic section at offset 0x3a27010 contains 41 entries: 2025-05-07T20:10:49.6712681Z Tag Type Name/Value 2025-05-07T20:10:49.6713248Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:49.6713843Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:49.6714222Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:49.6714426Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:49.6714620Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:49.6714903Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:49.6715122Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:49.6715323Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:49.6715522Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:49.6715744Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:49.6715957Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:49.6716163Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:49.6716445Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:49.6716646Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:49.6716842Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:49.6717110Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:49.6717397Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:10:49.6717587Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:49.6717724Z 0x000000000000000c (INIT) 0x80000 2025-05-07T20:10:49.6717847Z 0x000000000000000d (FINI) 0x261c5c 2025-05-07T20:10:49.6717970Z 0x0000000000000019 (INIT_ARRAY) 0x3a223b0 2025-05-07T20:10:49.6718103Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:10:49.6718242Z 0x000000000000001a (FINI_ARRAY) 0x3a22468 2025-05-07T20:10:49.6718365Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:49.6718486Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:49.6718618Z 0x0000000000000005 (STRTAB) 0xe368 2025-05-07T20:10:49.6718727Z 0x0000000000000006 (SYMTAB) 0x33a0 2025-05-07T20:10:49.6718875Z 0x000000000000000a (STRSZ) 374997 (bytes) 2025-05-07T20:10:49.6719020Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:49.6719174Z 0x0000000000000003 (PLTGOT) 0x3a28fe8 2025-05-07T20:10:49.6719312Z 0x0000000000000002 (PLTRELSZ) 18456 (bytes) 2025-05-07T20:10:49.6719430Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:49.6719570Z 0x0000000000000017 (JMPREL) 0x7b2d8 2025-05-07T20:10:49.6719682Z 0x0000000000000007 (RELA) 0x6ac28 2025-05-07T20:10:49.6719820Z 0x0000000000000008 (RELASZ) 67248 (bytes) 2025-05-07T20:10:49.6719959Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:49.6720079Z 0x000000006ffffffe (VERNEED) 0x6aae8 2025-05-07T20:10:49.6720200Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:49.6720320Z 0x000000006ffffff0 (VERSYM) 0x69c3e 2025-05-07T20:10:49.6720453Z 0x000000006ffffff9 (RELACOUNT) 1392 2025-05-07T20:10:49.6720559Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:49.6720565Z 2025-05-07T20:10:49.6720691Z ################################################################################ 2025-05-07T20:10:49.6720696Z 2025-05-07T20:10:49.6720728Z 2025-05-07T20:10:49.6720938Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:10:49.6824907Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:49.6853321Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:49.7104131Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:49.7147397Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:49.7200157Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:49.7233145Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:49.7266241Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:49.7297391Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:49.7411658Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.7446555Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.7680886Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.7716782Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.7779280Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.7823248Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.7859278Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.7893670Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.8304030Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.8665817Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.8886271Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.9861439Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.9901368Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:49.9988769Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.0327853Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.0329662Z ################################################################################ 2025-05-07T20:10:50.0331038Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:10:50.0331441Z 2025-05-07T20:10:50.0331920Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:10:50.0332977Z 2025-05-07T20:11:02.0215726Z 2025-05-07T20:11:02.0216128Z fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl is 2025-05-07T20:11:02.0216730Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:11:02.0217210Z 2025-05-07T20:11:02.0217407Z The wheel references external versioned symbols in these 2025-05-07T20:11:02.0217895Z system-provided shared libraries: librt.so.1 with versions 2025-05-07T20:11:02.0218362Z {'GLIBC_2.2.5'}, libgcc_s.so.1 with versions {'GCC_3.0'}, 2025-05-07T20:11:02.0218801Z libstdc++.so.6 with versions {'CXXABI_1.3.5', 'GLIBCXX_3.4.15', 2025-05-07T20:11:02.0219317Z 'GLIBCXX_3.4.9', 'GLIBCXX_3.4.21', 'CXXABI_1.3.11', 'GLIBCXX_3.4.19', 2025-05-07T20:11:02.0219951Z 'GLIBCXX_3.4.18', 'CXXABI_1.3', 'GLIBCXX_3.4.11', 'GLIBCXX_3.4.14', 2025-05-07T20:11:02.0220442Z 'CXXABI_1.3.7', 'CXXABI_1.3.3', 'GLIBCXX_3.4', 'GLIBCXX_3.4.20'}, 2025-05-07T20:11:02.0220938Z libc.so.6 with versions {'GLIBC_2.2.5', 'GLIBC_2.17', 'GLIBC_2.3', 2025-05-07T20:11:02.0221438Z 'GLIBC_2.3.3', 'GLIBC_2.7', 'GLIBC_2.6', 'GLIBC_2.14', 'GLIBC_2.3.2'}, 2025-05-07T20:11:02.0221919Z libpthread.so.0 with versions {'GLIBC_2.2.5', 'GLIBC_2.3.2', 2025-05-07T20:11:02.0222562Z 'GLIBC_2.3.4'}, libm.so.6 with versions {'GLIBC_2.2.5'}, 2025-05-07T20:11:02.0223250Z libcudart.so.12 with versions {'libcudart.so.12'}, libgomp.so.1 with 2025-05-07T20:11:02.0223747Z versions {'OMP_1.0'}, libdl.so.2 with versions {'GLIBC_2.2.5', 2025-05-07T20:11:02.0224151Z 'GLIBC_2.3.4'} 2025-05-07T20:11:02.0224285Z 2025-05-07T20:11:02.0224498Z This constrains the platform tag to "manylinux_2_27_x86_64". In order 2025-05-07T20:11:02.0225120Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:11:02.0225632Z wheel from source on a system with earlier versions of these 2025-05-07T20:11:02.0226063Z libraries, such as a recent manylinux image. 2025-05-07T20:11:02.0979158Z 2025-05-07T20:11:02.0979957Z 2025-05-07T20:11:02.0980810Z ################################################################################ 2025-05-07T20:11:02.0981867Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:11:02.0983280Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:02.0984347Z 2025-05-07T20:11:02.1001231Z -rw-r--r--. 1 root root 505M May 7 20:10 dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:02.1002667Z 2025-05-07T20:11:02.1002990Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:11:02.1004359Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:02.1005065Z 2025-05-07T20:11:03.0513567Z e104b3df85995724fab42c6a7def0cb48974742a dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:03.0515089Z 2025-05-07T20:11:03.0515388Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:03.0515765Z 2025-05-07T20:11:05.2613490Z a4f9d56a606e7ecc1787a658da2d9793614e9c662002d8d477027d8c65c22241 dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:05.2615436Z 2025-05-07T20:11:05.2616194Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:05.2617228Z 2025-05-07T20:11:06.1094330Z 5be3b8cb7a62a79077503754dddf3143 dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:06.1094846Z 2025-05-07T20:11:06.1095016Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:11:06.1242637Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:11:06.1242970Z with: 2025-05-07T20:11:06.1243258Z name: fbgemm_default_x86_clang_py3.10_cu12.6.3.whl 2025-05-07T20:11:06.1243599Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:11:06.1243912Z if-no-files-found: error 2025-05-07T20:11:06.1244209Z compression-level: 6 2025-05-07T20:11:06.1244553Z overwrite: false 2025-05-07T20:11:06.1244823Z include-hidden-files: false 2025-05-07T20:11:06.1245215Z env: 2025-05-07T20:11:06.1245499Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:11:06.1245814Z BUILD_ENV: build_binary 2025-05-07T20:11:06.1246096Z BUILD_TARGET: default 2025-05-07T20:11:06.1246341Z BUILD_VARIANT: cuda 2025-05-07T20:11:06.1246614Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T20:11:06.1246867Z ##[endgroup] 2025-05-07T20:11:06.1251067Z ##[command]/usr/bin/docker exec 116e1204f840def26d5a12bed91c8919f60dd50f044201ed2ddf00f7f7c08ce4 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:06.5810766Z With the provided path, there will be 1 file uploaded 2025-05-07T20:11:06.5811367Z Artifact name is valid! 2025-05-07T20:11:06.5812161Z Root directory input is valid! 2025-05-07T20:11:06.6653505Z Beginning upload of artifact content to blob storage 2025-05-07T20:11:07.5331119Z Uploaded bytes 8388608 2025-05-07T20:11:07.9511210Z Uploaded bytes 16777216 2025-05-07T20:11:08.4831324Z Uploaded bytes 25165824 2025-05-07T20:11:08.7646440Z Uploaded bytes 33554432 2025-05-07T20:11:09.2036526Z Uploaded bytes 41943040 2025-05-07T20:11:09.6701834Z Uploaded bytes 50331648 2025-05-07T20:11:10.0717507Z Uploaded bytes 58720256 2025-05-07T20:11:10.5101469Z Uploaded bytes 67108864 2025-05-07T20:11:10.9342088Z Uploaded bytes 75497472 2025-05-07T20:11:11.3542894Z Uploaded bytes 83886080 2025-05-07T20:11:11.7858506Z Uploaded bytes 92274688 2025-05-07T20:11:12.2556998Z Uploaded bytes 100663296 2025-05-07T20:11:12.6450319Z Uploaded bytes 109051904 2025-05-07T20:11:13.0582916Z Uploaded bytes 117440512 2025-05-07T20:11:13.4684904Z Uploaded bytes 125829120 2025-05-07T20:11:13.9710716Z Uploaded bytes 134217728 2025-05-07T20:11:14.3680645Z Uploaded bytes 142606336 2025-05-07T20:11:14.8843887Z Uploaded bytes 150994944 2025-05-07T20:11:15.2271235Z Uploaded bytes 159383552 2025-05-07T20:11:15.7171976Z Uploaded bytes 167772160 2025-05-07T20:11:16.1013788Z Uploaded bytes 176160768 2025-05-07T20:11:16.4966770Z Uploaded bytes 184549376 2025-05-07T20:11:16.9677938Z Uploaded bytes 192937984 2025-05-07T20:11:17.4027270Z Uploaded bytes 201326592 2025-05-07T20:11:17.8577734Z Uploaded bytes 209715200 2025-05-07T20:11:18.1778111Z Uploaded bytes 218103808 2025-05-07T20:11:18.6469535Z Uploaded bytes 226492416 2025-05-07T20:11:19.1008221Z Uploaded bytes 234881024 2025-05-07T20:11:19.4452580Z Uploaded bytes 243269632 2025-05-07T20:11:19.8439543Z Uploaded bytes 251658240 2025-05-07T20:11:20.2902607Z Uploaded bytes 260046848 2025-05-07T20:11:20.6227144Z Uploaded bytes 268435456 2025-05-07T20:11:21.1149022Z Uploaded bytes 276824064 2025-05-07T20:11:21.4606070Z Uploaded bytes 285212672 2025-05-07T20:11:21.8731031Z Uploaded bytes 293601280 2025-05-07T20:11:22.4108368Z Uploaded bytes 301989888 2025-05-07T20:11:22.8138948Z Uploaded bytes 310378496 2025-05-07T20:11:23.1944142Z Uploaded bytes 318767104 2025-05-07T20:11:23.6551568Z Uploaded bytes 327155712 2025-05-07T20:11:24.0448340Z Uploaded bytes 335544320 2025-05-07T20:11:24.4710151Z Uploaded bytes 343932928 2025-05-07T20:11:24.9674771Z Uploaded bytes 352321536 2025-05-07T20:11:25.3506600Z Uploaded bytes 360710144 2025-05-07T20:11:25.8007217Z Uploaded bytes 369098752 2025-05-07T20:11:26.1942668Z Uploaded bytes 377487360 2025-05-07T20:11:26.6550947Z Uploaded bytes 385875968 2025-05-07T20:11:27.1127281Z Uploaded bytes 394264576 2025-05-07T20:11:27.5144028Z Uploaded bytes 402653184 2025-05-07T20:11:27.9229286Z Uploaded bytes 411041792 2025-05-07T20:11:28.3448418Z Uploaded bytes 419430400 2025-05-07T20:11:28.7963579Z Uploaded bytes 427819008 2025-05-07T20:11:29.1852935Z Uploaded bytes 436207616 2025-05-07T20:11:29.6690085Z Uploaded bytes 444596224 2025-05-07T20:11:30.0065768Z Uploaded bytes 452984832 2025-05-07T20:11:30.4853660Z Uploaded bytes 461373440 2025-05-07T20:11:30.8861185Z Uploaded bytes 469762048 2025-05-07T20:11:31.2528721Z Uploaded bytes 478150656 2025-05-07T20:11:31.7860728Z Uploaded bytes 486539264 2025-05-07T20:11:32.0912228Z Uploaded bytes 494927872 2025-05-07T20:11:32.4780117Z Uploaded bytes 503316480 2025-05-07T20:11:32.9626535Z Uploaded bytes 511705088 2025-05-07T20:11:33.2852645Z Uploaded bytes 518284428 2025-05-07T20:11:33.3024816Z Finished uploading artifact content to blob storage! 2025-05-07T20:11:33.3026828Z SHA256 digest of uploaded artifact zip is 22d29705339a26b6a7652e4fd9c394d802ad2d855d1b5cb8a6c9d23074ab3ad4 2025-05-07T20:11:33.3028710Z Finalizing artifact upload 2025-05-07T20:11:33.3904693Z Artifact fbgemm_default_x86_clang_py3.10_cu12.6.3.whl.zip successfully finalized. Artifact ID 3081457514 2025-05-07T20:11:33.3905739Z Artifact fbgemm_default_x86_clang_py3.10_cu12.6.3.whl has been successfully uploaded! Final size is 518284428 bytes. Artifact ID is 3081457514 2025-05-07T20:11:33.3919656Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081457514 2025-05-07T20:11:33.4173794Z Post job cleanup. 2025-05-07T20:11:33.4178741Z ##[command]/usr/bin/docker exec 116e1204f840def26d5a12bed91c8919f60dd50f044201ed2ddf00f7f7c08ce4 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:33.6647469Z [command]/usr/bin/git version 2025-05-07T20:11:33.6681222Z git version 2.47.1 2025-05-07T20:11:33.6714285Z Copying '/github/home/.gitconfig' to '/__w/_temp/3513c928-e431-41ea-9174-080f661c406d/.gitconfig' 2025-05-07T20:11:33.6723750Z Temporarily overriding HOME='/__w/_temp/3513c928-e431-41ea-9174-080f661c406d' before making global git config changes 2025-05-07T20:11:33.6724878Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:11:33.6735893Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:11:33.6770305Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:11:33.6799903Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:11:33.7087673Z Entering 'external/asmjit' 2025-05-07T20:11:33.7140567Z Entering 'external/composable_kernel' 2025-05-07T20:11:33.7217295Z Entering 'external/cpuinfo' 2025-05-07T20:11:33.7270072Z Entering 'external/cutlass' 2025-05-07T20:11:33.7346655Z Entering 'external/googletest' 2025-05-07T20:11:33.7414356Z Entering 'external/hipify_torch' 2025-05-07T20:11:33.7478063Z Entering 'external/json' 2025-05-07T20:11:33.7557993Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:11:33.7577241Z http.https://github.com/.extraheader 2025-05-07T20:11:33.7582521Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:11:33.7612705Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:11:33.7888240Z Entering 'external/asmjit' 2025-05-07T20:11:33.7921028Z http.https://github.com/.extraheader 2025-05-07T20:11:33.7960181Z Entering 'external/composable_kernel' 2025-05-07T20:11:33.7994421Z http.https://github.com/.extraheader 2025-05-07T20:11:33.8047591Z Entering 'external/cpuinfo' 2025-05-07T20:11:33.8093118Z http.https://github.com/.extraheader 2025-05-07T20:11:33.8129081Z Entering 'external/cutlass' 2025-05-07T20:11:33.8173261Z http.https://github.com/.extraheader 2025-05-07T20:11:33.8218804Z Entering 'external/googletest' 2025-05-07T20:11:33.8253457Z http.https://github.com/.extraheader 2025-05-07T20:11:33.8283628Z Entering 'external/hipify_torch' 2025-05-07T20:11:33.8332934Z http.https://github.com/.extraheader 2025-05-07T20:11:33.8367151Z Entering 'external/json' 2025-05-07T20:11:33.8400604Z http.https://github.com/.extraheader 2025-05-07T20:11:33.8568852Z Stop and remove container: a1849b04f9ef420595d98b94e6cdfef5_amazonlinux2023_b685f4 2025-05-07T20:11:33.8574495Z ##[command]/usr/bin/docker rm --force 116e1204f840def26d5a12bed91c8919f60dd50f044201ed2ddf00f7f7c08ce4 2025-05-07T20:11:35.3419743Z 116e1204f840def26d5a12bed91c8919f60dd50f044201ed2ddf00f7f7c08ce4 2025-05-07T20:11:35.3454709Z Remove container network: github_network_90ef945cace04aa28be488fe06f897dc 2025-05-07T20:11:35.3459285Z ##[command]/usr/bin/docker network rm github_network_90ef945cace04aa28be488fe06f897dc 2025-05-07T20:11:36.1703012Z github_network_90ef945cace04aa28be488fe06f897dc 2025-05-07T20:11:36.1745462Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:11:36.1765309Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:11:36.1770605Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:11:36.1770986Z ##[endgroup] 2025-05-07T20:11:36.1897541Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:11:52.5739032Z Cleaning up orphan processes